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Expression of Heterologous Protein 

The invention relates to the manufacture of recombinant proteins; vectors for use in 
the recombinant manufacture of protein and recombinant proteins that are adapted for 
5 purification. 

The large scale production of recombinant proteins requires a high standard of 
quality control since many of these proteins are used as pharmaceuticals, for 
example: growth hormone; leptin; erythropoietin; prolactin, interleukins; interferon's. 

10 Moreover, the development of vaccines, particularly subunit vaccines, (vaccines 
based on a defined antigen, for example gpl20 of HTV), requires the production of 
large amounts of pure protein free from contaminatmg antigens which may provoke 
anaphylaxis. The production of recombinant protein in cell expression systems is 
based either on prokaryotic cell expression or eukaryotic cell expression. The latter 

15 is preferred when post-translation modifications to the protein are required. 
Eukaryotic systems include the use of mammalian cells, e.g. Chinese Hamster Ovary 
cells; insect cells e.g. Spodoptera spp; fimgal/yeast (e.g. Saccharomyces spp, Pichia 
itspp, Aspergillus spp) and plants. 

20 The production of heterologous recombinant proteins in an expression system relates 
to a variety of proteins used in both industrial and medical applications. 

For example, 'therapeutic polypeptides" include tumour suppressor polypeptides 
(e.g. p53 polypeptide, the APC polypeptide, the DPC-4 polypeptide, the BRCA-1 

25 polypeptide, the BRCA-2 polypqjtide, the WT-1 polypeptide, the retinoblastoma 
polypeptide (Lee, et al (1987) Nature 329:642), the MMAC-1 polypeptide, the 
adenomatous polyposis coli protein (United States Patent 5 J83,666), the deleted in 
colon carcinoma (DCC) polypeptide, the MMSC-2 polypeptide, the NF4 
polypeptide, nasopharyngeal carcinoma tumour suppressor polypeptide (Cheng, et al 

30 1998. Proc. Nat Acad. Sci. 95:3042-3047), the MTSl polypeptide, the CDK4 
polypeptide, the NF-1 polypeptide, the NF2 polypeptide, and the VHL polypeptide. 

1 
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Therapeutic polypeptides also include "antigenic polypeptides" (e.g. tumour rejection 
antigens the MAGE, BAGE, GAGE and DAGE families of tumour rejection 
antigens, see Schulz et al Proc Nati Acad Sci USA, 1991, 88, pp991-993). Antigenic 
5 polypq)tides also includes polypeptide antig^ used in the preparation of vaccines 
which provide protection against infectious agents. For example, viruses such as 
Human Immunodeficiency Virus (HIVl & 2); Human T Cell Leukaemia Virus 
(HTLV 1 & 2); Ebola virus; Human Papilloma Virus (e.g. HPV-2, HPV-5, HPV-8 
HPV46, HPV.18, HPV-31, HPV-33, HPV-52, HPV-54 and HPV-56); papovavirus; 

10 rhinovirus; poliovirus; h^pesvirus; adenovirus; Epstein Barr virus; influenza virus, 
hepatitis B and C viruses. Antigens derived fi:om pathogenic bacteria such as 
Staphylococcus aureus; Staphylococcus epiderfnidis; Enterqcoccus faecalis; 
Mycobacterium tuberculsis; Streptococcus group B; Streptoccocus pneumoniae; 
Helicobacter pylori; Neisseria gonorrhea; Streptococcus group A; Borrelia 

15 burgdorferi; Coccidiodes immitis; Histoplasma sapsulatum; Neisseria meningitidis ; 
Shigella flexneri; Escherichia coli; Haemophilus influenzae. Antigens derived firom 
parasites such as Trypanosoma spp, Plasmodia spp, Schiztosoma spp\ and pathogenic 
> fungi such as Candida spp. 

20 The development of subunit vaccines (e.g. vaccines in which the immunogen is a 
purified protein) has been the focus of considerable research in recent years and 
requires the isolation aad purification of the candidate protein from a complex 
mixture of proteins. The emergence of new pathogens and the growth of antibiotic 
resistance have created a need to develop new vaccines and to identify fiarther 

25 candidate molecules useful in the development of subumt vaccines. There are a 
number of routes via which vaccines may be administered to an animal. The 
traditional route is by transdermal injection of the vaccine containing an, immunogen 
that invokes an iimnune response. An alternative route that is not typically 
undertaken is an orally administered vaccine. The immunogen contacts the immune 

30 system via the mammalian muscosal membrane which lines the small intestine 
through the activity of so called M cells. The M cells overlie regions of the intestine 
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refeired to as Peyers Patches and function to sample antigais contained in ingested 
material and are therefore essential components in mediating mucosal immunity. 
Peyers Patches have been identified as major sites for the adherence and entry of 
enteric pathogens such as Escherichia, Salmonella, Yersina and Shigella. 

5 

Plant derived vaccines have been developed which utilise plant oil bodies which have 
been engineered to present an antigen to which an immune response is desired, see 
WOOl/95934. Plants that bear oilseeds such as soybean, rapeseed and sunflower 
store triglycerides in their seeds which provide an energy store for germination and 
10 subsequent seedling development. Further examples of this are described in 
WO93/20216; WO96/21029 and W098/21115. 

Therapeutic polypeptides can be "cytotoxic polypeptides" (e.g. pseudomonas 
exotoxin, ricin toxin, diptheria toxin); or polypeptides which are "cytostatic 
15 polypeptides" (e.g. p21, the retinoblastoma polypeptide, the E2F-Rb polypeptide, 
cyclin dependent kinase inhibitors such as P16, pl5, pl8 and pl9, the growth arrest 
specific homeobox (GAX) polypeptide as described in Branellec, et al, see 
W097/16459 and WO96/30385. 

20 Also included, within the scope of therq)eutic polypeptides are therapeutic antibodies. 
Prefer£d}ly said antibodies are monoclonal antibodies or at least the active bmding 
fragments thereof Monoclonal antibodies may be humanised or chimeric antibodies. 
A chimeric antibody is produced by recombinant methods to contain the variable 
region of an antibody with ah invariant or constant region of a hmnan antibody. A 

25 humanised antibody is produced by recombinant methods to combine the 
complementarity determining regions (CDRs) of an antibody with both the constant 
(C) regions and flie firamework regions from the variable (V) regions of a human 
antibody. 

30 Plants that bear oilseeds such as soybean, rape§eed and sunflower store triglycerides 
in their seeds which provide an energy store for germination and subsequent seedling 
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development la the seeds the oil globules fonn oil bodies in the cytoplasm that are 
membrane bound organelles that have protein in the membrane surface. An example 
of a protein found associated with oil bodies is an oleosin. The oleosins are a family 
of proteins that have divergent amino and carboxyl termini with more higjily 
5 conserved lipophilic central domain. These protems are strongly associated with oil 
bodies and it is thought that the association is due in part to the lipophilic domain. 

We have cloned two plant lipases referred to as OBLl and OBL 2 from castor bean 
that have different lipid specificities. The sequence of OBL 1 and OBL 2 has 
10 revealed a lipophiUc domain that lacks any homology to that found in the oleosin 
described in WO93/21320 and represents an alternative means to localise 
polypeptides which comprise this sequence to oil bodies and thereby facilitate their 
purification. 

15 According to an aspect of the invention there is provided a nucleic acid rdolecule that 
encodes a chimeric polypeptide wherein said nucleic acid molecule comprises a first 
part selected from the group consisting of: 

i) a nucleic acid molecule consisting of a nucleic acid sequence which 
encodes an amino acid sequence as represented in Figure 3; 
20 ii) a nucleic acid molecule which hybridises to the nucleic acid molecule 

in (i) and which specifically targets said polypeptide to an oil bodj^ 
and 

iii) a nucleic acid molecule which differs from the nucleic acid molecule's 
of (i) and (ii) due to the degeneracy in the genetic code; and a second 
25 part that encodes an heterologous polypeptide. 

An heterologous polypeptide is a polypeptide to which a targeting domain as herein 
disclosed has been fiised which would not naturally include such a targeting domain. 

In a preferred embodiment of the invention said targeting domain is encoded by a 
nucleic acid molecule comprising a nucleic acid sequence which hybridises under 



4 
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Stringent hybridisation conditions to a nucleic acid molecule which encodes an amino 
acid sequence as represented in Figure 3. 

Stringent hybridisation/washing conditions are well known in the art For example, 
5 nucleic acid hybrids fliat are stable after washing in O.lx SSC, 0.1% SDS at 60^*0. It 
is well known in the art that optimal hybridisation conditions can be calculated if the 
sequence of the nucleic acid is known. Typically, hybridisation conditions uses 4-6 
X SSPE (20x SSPE contains 175.3g NaCl, 88.2g NaH2P04 HzO'and 7.4g BDTA 
dissolved to 1 litre and the pH adjusted to 7.4); 5-1 Ox Denhardts solution (50x 
10 Denhardts solution contains 5g FicoU (type 400, Pharmacia), 5g pplyvinylpyrrolidone 
and 5g bovine serum albumen; lOO^ig-l.Omg/ml sonicated salmon/herring DNA; 0.1- 
1.0% sodium dodecyl sulphate; optionally 40-60% deionised foimamide. 
Hybridisation temperature will vary depending on the GC content of the nucleic acid 
target sequence but will typically be between 42^- 65^ C. 

15 

In a preferred embodiment of the invention said nucleic acid molecule comprises, a 
nucleic acid sequence that has at least 30% homology to the nucleic acid sequence 
which encodes an amino acid sequence as represented in Figure 3. Preferably said 
homology is at least 40%; 50%; 60%; 70%; 80%; 90%; or at least 99% identity with 
20 the nucleic acid sequence that encodes the amino acid sequence as represented in 
Figures. 

According to a further aspect of the invention there is provided a chimeric 

polypeptide encoded by at nucleic acid molecule according to the invention. 

•. 

25 Recombinant protein production relates to the synthesis of protein in expression 
systems (e.g. bacterial expression systems, yeast expression systems, insect 
expression systems, mammalian expression systems and plant expression systems). 
Heterologous polypeptides include commercially important polypeptides, for 
example enzymes used in biocatalysis (e.g. restriction enzymes, enzymes used in 

30 industrial processing; e.g. amylases, proteases, nucleases, lipases) and therapeutic 
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polypeptides as herein disclosed. The large scale production of recombinant proteins 
requires a high standard of quality control since many of these proteins are used as 
pharmaceuticals, for example, interleukins, growth hormone, erythropoietin, 
interferon. Moreover, the development of vaccines, particularly subunit vaccines, 
5 (vaccines based on a defined antigen, for example gpl20 of HIV), requires the 
production of large amounts of pure protein firee firom contaminating antigens which 
may provoke anaphylaxis. 

In a preferred embodiment of the invention said second part encodes an enzyme. 

id 

In a further preferred embodiment of the invention said second part encodes a 
pharmaceutical polypeptide. 

In a yet further preferred embodiment of the invention said second part encodes an 

15 antigenic polypeptide. Preferably said second part encodes an antigenic polypeptide 
associated with an infectious disease condition. . 

hi a preferred embodiment of the invention said second pairt encodes a polypeptide 
associated with a viral disease condition. 

20 

In a preferred embodiment of the mvention said viral disease is the result of an 
infection caused by a virus selected from the famiUes consisting of: Picomaviridae; 
Caliciviridae; Togaviridae; Flaviridae; Coronaviridae; Rhabdoviridae; Filoviridae; 
Paraniyxoviridae; Orthomyxoviridae; Bunyaviridae; Arenaviridae; Reoviridae; 
25 Bimavirdae; Retroviridae; Hepadnaviridae; Parvoviridae; Papoviridae; Adenoviridae; 
Herpesviridae; and Poxviridae. 

In a preferred embodiment of the invention said virus is selected from the famihes 
consisting of: Adenoviridae; Coronaviridae; Herpesviridae; Orthomyxoviridae; 
30 Picomaviridae; Poxviridae; Reoviridae; and Retroviridae. 
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Iq a further preferred embodiment of the invention said second part encodes a 
polypq)tide associated with a bacterial disease condition. 

In a preferred embodiment of the invention said bacterial disease is caused by a 
S bacterial pathogen selected from the group consisting of: Actinomyces; Aeromonas; 
Bacillus; Bacteroides; Bordetella; Brucella; Compylobacter;. Capnocyclophaga; 
Chlamydia; Clostridium; Corynebacterium; Eikenella; Erysipelothrix; Escherichia; 
Fusobacterium; Hemophilus; Klebseilla; Legionella; Leptospira; Listeri£^ 
Mycobacterium; Mycoplasma; Neisseria; Nocardia; Pasteurella; Proteus; 
10 Pseudomonas; Rickettsia; Salmonella; Selenomonas; Shigella; Staphylococcus; 
Streptococcus; Treponema; Vibro and Yersnia. . 

In a further preferred embodiment of the invention said second part encodes a 
polypeptide associated with a fungal disease condition. 

In a preferred embodiment of the invention said fungal disease condition is caused by 
a fungal pathogen selected from the group consisting of: Aspercillus; Blastomyces; 
Candida; Cryptococcus; Histoplasma; Phycomyces; and Saccharomyces. 

20 In a further preferred embodiment of the invention said second part encodes a 
polypeptide associated with a parasitic disease condition. 

hi a preferred embodiment of the invention said parasitic disease condition is caused 
by a parasitic pathogen selected from the group consisting of: Trypanosoma; 
25 . Leishmania; Schistosoma; Plasmodium; Entamoeba; Eimeria; and Trichomonas, 

In a preferred embodiment of the invention said parasitic pathogen is selected from 
the group consisting of: Entamoeba; Eimeria; and Trichomonas. 

30 In an alternative preferred embodiment of the invention said second part encodes a 
polypeptide associated with cancer. 
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In a preferred embodiment of the invention said polypeptide is a tumoiir rejection 
antigen polypeptide. 

5 In an alternative preferred embodiment of the invention said second part encodes a 
polypeptide that is an adhesive polypeptide, preferably said adhesive polypeptide is a 
mussel adhesive polypeptide. 

Mussel-derived adhesives possess a variety of characteristics that make them of 
10 particular interest to material scientists. The adhesives display a high bonding 
strength, versatility to a variety of bonding surfaces, resistance and durability, non- 
toxicity, waterproof and cure without high temperatures properties, characteristics 
which give these adhesives a range of potential uses and which no man-made 
adhesive on the market currentiy possesses. These properties have led to predictions 
15 of wide-ranging applications in engineering and medicine. For example a 
. requirement for a natural, strong and waterproof adhesive can be found in the marine 
* industry, where it can be used in shijp-building. Medical uses, where such an 
adhesive, with its further advantages of being neither cytotoxic nor antigenic could be 
used in orthopaedic and dental applications. 
20 . 

In a preferred embodiment of the invention said mussel adhesive polypeptide is 
encoded by a nucleic acid molecule selected from the group consisting of: 

i) a nucleic acid molecule which encodes a polypeptide comprising the amino 
25 acid motif 

X-K-X-X-Y-P-X-X-Y-K 
wherein X is any anmio acid residue; 

ii) a nucleic acid molecule which encodes a polypeptide wherein at least one of 
the residues in said motif is niodified by hydroxylation; and 

30 iii) a nucleic acid molecule which encodes a polypeptide which has adhesive 
properties. 

8 
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In a preferred embodiment of the invention said adhesive polypeptide comprises the 
amino acid sequence motif: A-K-P-S-Y-P-P-T-Y-K. Preferably said adhesive 
polypeptide comprises of at least one motif which has the amino acid sequence A-K- 
5 P-S-Y-P-P-T-Y-K. Preferably said adhesive polypeptide consists of a plurality of 
said moti& linked in tandem. 

In a preferred embodiment of the inventipn said adhesive polypeptide comprises at 
least 10 motifs, preferably greater tiian 10 motifs. More preferably still said adhesive 
10 polypeptide comprises at least 50 motifs, preferably between 50 and 100 motifs, 
more preferably between 60-85 motife. 

In a further preferred embodiment of the invention said hydroxylation is to an amino 
. acid at position 6 and/or 7 and/or 9 of at least one motif of the amino acid sequence, 
15 X-K-X-X-Y-P-X-X-Y-K More preferably still said hydroxylation is to amino acid 
position 6 and/or 7 and/or 9 of at least one motif of the amino acid sequence, A-K-P- 
S-Y-P-P-T-Y-K. 

In a further preferred embodiment of the invention said adhesive polypeptide further 
20 comprises a second motif wherein said second motif is represented by the amino acid 
sequence: X P X P X X X X Y K. 

Preferably said second motif is selected from the group consisting of at least one of 
PPKPTPPTY K;SPP.PP VYKYK; orSP.PPPT P V Y K, or 
25 combinations thereof . . 

In a further preferred embodiment of the invention said nucleic acid molecule 
encodes a secretion signal, for example an expansin secretion signal. 

30 In a preferred embodiment of the invention said secretion signal comprises the amino 
acid sequence: MAFSYSPFSSLFLLPFFFVFTFADYGGWQSGHATFYGGGDASGT, 
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or functional variant thereof. 

In a yet further preferred embodiment of the invention said adhesive polypeptide 
comprises an ©tidoplasmic reticulum targeting sequence, preferably said sequence is 
5 represented by the andno acid sequence HDBL, or functional variant thereof. 

In a further preferred embodiment of the inveation said second part encodes a spider 
polypeptide, preferably a spider silk polypeptide. 

10 In our co-pending application WO04/016651, we describe a new spider silk protein. 
In addition to the tensile strength and elasticity of the silk, the additional 
characteristics of super contraction, insolubility and non-allergenicity make the use of 
silk in medical and industrial applications an attractive option. Medical applications 
include; artificial tendons/ligaments, wound-closure systems, such as vascular wound 

15 repair devices, haemostatic (tessings, patches, glues and sutures. . Industrial 
applications include; bullet-proof vests, light-weight body armour, cables, ropes and 
parachute cords, even fishing line. Other advantages in using spider silk rather than 
the synthetic alternatives such as Kevlar™ is that this fibre is derived from 
petrochemicals, which are processed under extreme, often polluting conditions and 

20 add to the waste accumulating in landfills. 

In a preferred embodiment of the invention said spider silk polypeptide is encoded by 
a nucleic acid molecule comprising a nucleic acid sequence selected from the group 
consisting of: 

25 . i) a nucleic acid sequence consisting of the sequence as represented in 

Figure 4a or 4b; . 

ii) a nucleic acid sequence which, encodes a polypeptide domain 
coinprising the amino acid sequence AGRGQGGYGQGAGG and at 

least two motifs rich in polyalanine wherein said polyalanine motifs 
30 comprise at least 6 alanine aminp acid residues; 
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iii) a nucleic acid sequence which hybridises to the sequmce presented in 
(i) and (ii) above and which encodes a silk polypeptide; and 

iv) a nucleic acid sequences that axe degenerate as a result of the genetic 
code to the nucleic acid sequence defined in (i) - (iii) above. 

5 

Li a preferred embodiment of the invention said domain comprises a motif 
comprising a repetitive tripeptide sequence, GGX (glycinerglycine: X is any amino 
acid residue), and at least one SS(serine: serine)-polyalanine sequence. 

10 Li a yet further preferred embodiment of the invention said second part encodes a 
polypeptide with glucosyltransferase activity. 

Typically, GTases are enzymes that post-translationally transfer glucosyl residues 
firom an activated nucleotide sugar to monomeric and polymeric acceptor molecules 

15 such as other sugars, proteins, lipids and other organic substrates. These glucosylated 
molecules take part in diverse metabolic pathways and processes. The transfer of a 
glucosyl moiety can alter the acceptor's bioactivity, solubility and transport 
properties within the cell and througjiout the plant. One family of GTases in higher 
plants is defined by the presence of a C-terminal consensus sequence. The GTases of 

20 this family function in the cytosol of plant cells and catalyse the transfer of glucose to 
small molecular weight substrates, such as phenylpropanoid derivatives, coumaiins, 
flavonoids, other secondary metabolites and molecules known to . act as plant 
• • • . hormones. Other GTases transfers glucosyl residues to peptides and polypeptides. . 

25 In a preferred embodiment of the invention said glucosyltransferase is encoded by a 
nucleic acid molecxile selected firom the group consisting of: 

i) a nucleic acid molecule selected firom the group consisting of nucleic acid 
sequences as represented in Table 1 ; 

ii) a nucleic acid molecule which hybridise to the sequences represented in (ii) 
30 above; and which have glucosyltransferase activity; and 
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iii) nucleic acid molecules consisting of nucleic acid sequences which are 
degenerate as a result of the genetic code to the sequences defined in (i) and 
(ii) above. 

5 In a preferred embodiment of the invention said nucleic acid molecules hybridise 
under stringent hybridisation conditions to the nucleic acid molecules as represented 
mTablel. 

The GTase sequences disclosed in Table 1 are cDNA sequences to which the oil 
10 body targeting domain can be fused typically as an in-firame translational fusion. 

In a still further preferred embodiment of the invention said nucleic acid encodes a 
cleavajge site to facilitate the purification of said second part firom said first part. 
Preferably said cleavage site is a proteolytic cleavage site. 
15 ■ 

According to a further aspect of the invention there is provided a vector comprising a 
nucleic acid molecule that encodes a chimeric polypeptide according to the invention. 

Preferably the nucleic acid in the vector is operably linked to an appropriate promoter 
or other regulatory elements for transcription in a host cell such as a prokaryotic, (e.g. 

20 bacterial), or eukaryotic (e.g: fimgal, plant, mammalian or insect cell). The vector 
may be a bi-fimctional expression vector which functions in multiple hosts. In the 
. example of nucleic acids encoding polypeptides according to the invention this may 
contain its native promoter or other regulatory elements and in the case of cDNA this 
may be under the control of an appropriate promoter or other regulatory elements for 

25 expression in the host cell. 

By "promoter" is meant a nucleotide sequence upstream firom the transcriptional 
initiation site and which contains all the regulatory regions required for transcription. 
Suitable promoters include constitutive, tissue-specific, uiducible, developmental or 
30 other promoters for expression in plant cells comprised in plants depending on 

12 
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design. Such promoters include viral, fungal, bacterial, animal and plant-derived 
promoters capable of functioning in plant cells. 

Constitutive promoters include, for example CaMV 35S promoter (Odell et al (1985) 
5 Nature 313, 9810-812); rice actin (McEhx)y et al (1990) Plant Cell 2: 163-171); 
ubiquitin (Christian et al . (1989) Plant MoL Biol. 18 (675-689); pEMU (Last et al 
(1991) Theor Appl. Genet. 81: 581-588); MAS (Velten et al (1984) EMBO J. 3. 
2723-2730); ALS promoter (U.S. AppUcation Seriel No. 08/409,297), and the like. 
Other constitutive promoters include those in U.S. Patent Nos. 5,608,149; 5,608,144; 
10 5,604,121; 5,569,597; 5,466,785; 5,399,680, 5,268,463; and 5,608,142. 

Chemical-regulated promoters can be used to modulate the expression of a gene in a 
plant through Ihe application of an exogenous chemical regulator. Depending upon , 
the objective, the promoter may be a chemical-mducible promoter, where application 

15 of the chemical induced gene expression, or a chemical-repressible promoter, where • 
application of the chemical represses gene expression. Chemical-inducible 
promoters are known in the art and include, but are not limited to, the maize In2-2 
promoter, which is activated by benzenesulfonamide herbicide safeners, the maize 
GST promoter, which is activated by hydrophobic electrophilic compounds that are 

20 used as pre-emergent herbicides, and the tobacco PR-la promoter, which is activated 
by salicylic acid. Other chemical-regulated promoters of interest include steroid- 
responsive promoters (see, for example, the glucocorticoid-inducible promoter in 
Schena et al (1991) Proc, Natl Acad. Sci. USA 88: 10421-10425 and McNellie et al. 
(1998) Plant J. 14(2): 247-257) and tetracycline-inducible and tetracycline- 

25 repressible promoters (see, for example, Gatz et al. (1991) Mol. Gen. Genet. 227: 
229-237, and US Patent Nos. 5,814,618 and 5,789,156, herein incorporated by 
reference. 
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Where enhanced expression in particular tissues is desired, tissue-specific promoters 
can be utilised. Tissue-specific promoters include those described by Yamamoto et 
al (1997) Plant J. 12(2): 255-265; Kawamata et al (1997) Plant Cell Physiol. 38(7): 
792-803; Hansen et al (1997) MoL Gen. Genet. 254(3): 337-343; Russell et al 
5 (1997) Transgenic Res. 6(2): 157-168; Rinehart et al (1996) Plant Physiol. 112(3): 
1331-1341; Van Camp et al (1996) Plant Physiol. 112(2): 525-535; Canevascni et al 
(1996) Plant Physiol. 112(2): 513-524; Yamamoto et al (1994) Plant Cell Physiol. 
35(5): 773-778; Lam (1994) Results Probl. Cell Differ. 20: 181-196; Orozco et al 
(1993) Plant MoL Biol. 23(6): 11294138; Mutsuoka et al (1993) Proc. Natl. Acad. 
10 Sci. USA 90(20): 9586-9590; and Guevara-Garcia et al (1993) Plant J. 4(3): 495-50. 

In a preferred embodiment of the invention said tissue specific promoter is a 
promoter which is active during the accumulation of oil in developing oil seeds, (for 
example see Broun et a/. (1998) Plant J. 13(2): 201-210. 

15 

"Operably linked" means joined as part of the same nucleic acid molecule, suitably 
positioned and oriented for transcription to be initiated fi-om the promoter. DNA 
operably linked to a promoter is "under transcriptional initiation regulation" of the 
promoter. In a preferred embodiment the promoter is an inducible promoter or a 
20 developmentally regulated promoter. 

Particular vectors are nucleic acid constructs which operate as plant vectors. Specific 

procedures and vectors previously used with wide success upon plants are described 
by Guerineau and Mullineaux (1993) (Plant transformation and expression vectors. 
25 In: Plant Molecular Biology Labfax (Croy RRD ed) Oxford, BIOS Scientific 
Publishers, pp 121-148. Suitable vectors.may include plant viral-derived vectors (see 
e.g.EP-A-194809). 

Vectors may also include selectable genetic marker such as those that confer 
30 selectable phenotypes such as resistance to herbicides (e.g. kanamycin, hygromycm, 

14 
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phosphinotricin, chlorsulfuron, methotrexate, gentamycin, spectinomycm, 
imidazolinones and glyphosate). 

Alternatively, or in addition, said vectors are vectors suitable for mammalian cell 
5 transfection or yeast cell transfection. In the latter example multi-copy vectors such 
as 2\x episomal vectors are preferred. Alternatively yeast CEN vectors and 
integrating vectors such as YIP vectors are suitable for transformation of yeast 
species such as Saccharomyces cerevisiae mdPichia spp. 

10 

According to a yet further aspect of the invention there is provided a method to 
produce a chimeric polypeptide according to the invention comprising the steps of: 

i) . providing a cell according to the invention and growth conditions 

conducive to the production of a chimeric polyp^tide according to 
15 the invention; and optionally 

ii) purifying said chimeric polypeptide from said cell or growth media. 

According to a fiarfher aspect of the invention there is provided a cell transfected or 
transfomied with at least one nucleic acid molecule or vector according to the 
invention. 

20 

In a prefenred embodiment of the invention sai^ 

In a preferred binbodiment of the invention said cell is a plant cell. 

25 According to a further aspect of the invention there is provided a plant comprising a 
cell according to the invention. 

In a preferred embodiment of the invention said plant is selected from: com (Zea 
mays), canola (Brassica napus, Brassica rapa ssp.), flax (Linum usitatissimum), 

15 
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alfalfa {Medicago sativa), rice {Oryza sativa), rye (Secale cerale), sorghum (Sorghum 
bicolor, Sorghum vulgare), sunflower (Helianthus annus), wheat {Tritium aestivum), 
soybean (Glycine max), tobacco (Nicotiana tabacum), potato (Solanum tuberosum), 
peanuts (Arachis hypogaea), cotton (Gossypium hirsutum), sweet potato (lopmoea 

5 bcOatus), cassava (Mcaiihot esadenta), coffee (Cofea spp.), coconut (Cocas nucifera), 
pineapple (Amna comosus), citris tree (Oims spp.) cocoa (Theobroma cacao), tea 
(Camellia senensis), banana (Musa spp.), avacado (Pmea americana), fig (Ficus 
casica), guava (Psidium guajava), mango (Mangifer indica), olive (0/ea europaea), 
papaya (Carica papaya), cashew (Anacardium occidentale), macadamia (Macadamia 

10 intergrifolia), almond (Pmnitf amygdalus), sugar beets (5e/d vulgaris), oats, barley, 
vegetables and ornamentals. 

Preferably, plants of the present invention are crop plants (for example, cereals and 
pulses, maize, wheat, potatoes, fepioca, rice, sor^um, nrillet, cassava, barley, pea), 

15 and other root, tuber or seed crops. Important seed crops are oil-seed rape, sugar 
beet, maize, sunflower, soybean,sorghum, and flax (linseed). Horticultural plants to 
which the present invention may be applied may include lettuce, endive, and 
vegetable brassicas including cabbage, broccoli, and cauliflower. The present 
invention may be applied in tobacco, cucurbits, carrot, strawberry, sunflower, tomato, 

20 pepper. Grain plants that provide seeds of interest include oil-seed plants and 
leguminous plants. Seeds of iuterest include grain seeds, such as com, wheat, barley, 
rice, sorghum, rye, etc. 

In a prefenred eimbodiinent of the invention said plant is m oil seed plant. 

25 

Oil seed plants include cotton, soybean, safflower, sunflower, Brassica, maize, 
alfelfe, palm, coconut, etc. Leguminous plants include beans and peas. Beans 
include guar, locust bean, fenugreek, soybean, garden beans, cowpea, mungbean, 
lima bean, fava been, lentils, chickpea, etc. 
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la a preferred embodiment of the invention said oil seed plant is safflower. 

In a further preferred embodiment of the invention said cell is a prokaryotic cell. 

5 

According to a further aspect of the invention there is provided a seed comprising a 
cell according to the invention. 

According to a further aspect of the invention there is provided a vaccine preparation 
10 comprising a cell, or a fraction of a cell, according to the invention. 

In a preferred embodiment of the invention said prqparation is an oil body containing 
fraction. Preferably said firaction includes a chimeric polypeptide according to the 
invention associated with oil bodies in said fraction. 

15 • , 

. In a further preferred embodiment of the invention said preparation includes at least 

one carrier. . 

The term carrier is construed in the following manner, A carrier is an immunogenic 
20 molecule which, when bound to a second molecule augments immune responses to 

the latter. Some antigens are not intrinsically immunogenic (i.e. not immunogenic in 
. their own right) yet may be capable of generating antibody responses when associated 

with a foreign protein molecule such as keyhole-lhnpet h'aemocyanin or tetanus 
• . toxoid. Certain antigens which lack T-cell epitopes, such as polymers with a 
25. repeating B-cell epitope are intrinsically immunogenic to a limited extent. These are 

known as. T-independent antigens. Such antigens benefit from association with a 

carrier such as tetanus toxoid, under which circumstance they elicit much stronger 

antibody responses. 

30 In a further preferred embodiment of the invention said preparation includes at least 
one adjuvant. 
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An adjuvant is a substance or procedure which augments specific immune responses 
to antigens by modulating the activity of immune cells. Examples of adjuvant 
include, by example only, Freund's adjuvant, moray deputies, liposome's. Adjuvant 
5 are distinct from carriers. 

' The vaccine preparations may conveniently be presented in unit dosage form and 
may be prepared by any of the methods well-known in the art of pharmacy. In 
general, the vaccine preparations are prepared by uniformly and intimately bringing 

10 the active polypeptide into association with a liquid stabiliser, a finely divided soUd 
stabiliser, or both, and then, if necessary, shaping the product. Solid dosage forms 
for oral administration include capsules, tablets, pills, powders and granules. In such 
solid dosage forms, the active polypeptide is typically mixed with at least one inert, 
pharmaceutically accqptable recipient such as sodium citrate or declaim phosphate 

15 and/or one or more: a) fillers or extenders such as starches, lactose, sucrose, glucose, 
manifold and silica acid; b) binders such as carboxymethylcellulose, alginates, 
gelatin, polyvinylpyrrolidone, sucrose and acacia; c) humectants such as glycerol; d) 
disintegrating agents such as agar-agar, calcium carbonate, potato or tapioca starch, 
alginic acid, certain silicates and sodium carbonate; e) solution retarding agents such 

20 as paraffin; f) absorption accelerators such as quaternary ammonium compounds; g) 
wetting agents such as cetyl alcohol and glycerol mohostearate; h) absorbents such as 
kaolin and bentonite clay and i) lubricants such as talc, calcium stearate, magnesium 
stearate, solid polyethylene glycols, sodium lauryl sulfate and mixtures, thereof. In 
the case of capsules, tablets and pills, the dosage form may also comprise buffering 

25 agents. Solid compositions of a similar type may also be employed as fillers in soft 
and hard-filled gelatin capsules using such excipients as lactose or milk sugar as well 
as hi^ molecular weight polyethylene glycol, for example. The solid dosage forms 
of tablets, dragees, capsules, pills, and granules can be prepared with coatings and 
shells such as enteric coatings and other coatings well known in the pharmaceutical 

30 formulating art. They may optionally contain opacifying agents and may also be of a 
composition such that they release the active ingredient(s) only, or preferentially, in a 
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certain part of the intestinal tract, and/or in delayed fashion. Examples of embedding 
compositions that can be used include polymeric substances and waxes. 

According to a further aspect of the invention there is provided a method to 
5 immunise an animal to at least the second part of the chimeric polypeptide according 
to the invention, comprising adnnnistering an effective amount of a chimeric 
polypeptide or preparation according to the invention sufficient to stimulate an 
immune rehouse to said second part. 

10 hi a preferred method of the invention said animal is human. 

In an alternative preferred method of the invention said animal is selected from the 
group consisting of: mouse; rat; hamster; goat; cow, horse, pig, dog, cat, or she^. 

15 In a further preferred method of the invention said immune response is the 
production of antibodies to said chimeric polypeptide. 

In an alternative preferred method of the invention said immime response is the 
production of T-helper cells which recognise the second part of said chimeric 
20 polypeptide. 

A preferred route of administration is oral. However, intradermal, subcutaneous, 
intramuscular or intranasal immunisatiori is within the scope of the invention. 

25 According to a yet further aspect of the mvention there is provided an antibody 
. obtainable by the method according to the invention. 

In a preferred embodimient of the invention said antibody is a therapeutic antibody. 

30 In a further preferred embodiment of the invention said antibody is a diagnostic 
antibody. Preferably said diagnostic antibody is provided with a label or tag. 
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In a preferred embodiment of the invention said antibody is a monoclonal antibody or 
binding fragment thereof. Preferably said antibody is a hmnanised or chimeric 
antibody. 

5 

A chimeric antibody is produced by recombinant methods to contain the variable 
region of an antibody with an invariant or constant region of a hmnan antibody. A 
humanised antibody is produced by recombinant -methods to combine the 
complementarity determining regions (CDRs) of an antibody with both the constant 
10 (C) regions and the framework regions from the variable (V) regions of a human 
antibody. 

Chimeric antibodies are recombinant antibodies in which all of the V-regions of a 
mouse or rat antibody are combined with human antibody C-regions. Humanised 

15 antibodies are recombinant hybrid antibodies which fuse the complimentarity 
determining regions from a rodent antibody V-region with the framework regions 
from the human antibody V-regions. The C-regions ,from the human antibody are also 
used. The complimentarity determimng regions (CDRs) are the regions within the N- 
terminal domain of both the heavy and light chain of the antibody to where the 

20 majority of the variation of the V-region is restricted. These regions form loops at the 
surface of the antibody molecule. These loops provide the binding surface between 
the antibody'and antigen. 

Antibodies from non-human animals provoke an immune response to the foreign 
25 antibody arid its removal from the circulation. Both chimeric and humanised 
antibodies have reduced antigenicity when injected to a human subject because there 
is a reduced amount of rodent (i.e. foreign) antibody within the recombinant hybrid 
antibody, while the human antibody regions do not elicit an immune response. This 
results in a weaker immune response and a decrease in the clearance of the antibody. 
30 This is clearly desirable when using therapeutic antibodies in the treatment of human 
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diseases. Humanised antibodies are designed to have less "foreign" antibody regions 
and are therefore thought to be less immunogenic than chimeric antibodies. 

It is also possible to create single variable regions, so called single chain antibody 
5 variable region fragments (scFv's). If a hybridoma exists for a specij&c monoclonal 
antibody it is well within the knowledge of the skilled person to isolate scFv's from 
mRNA extracted from said hybridoma via RT PGR. Alternatively, phage display 
screening can be undertaken to identify clones expressing scFv's. Alternatively said 
fragments are "domain antibody fragments". Domain antibodies are the smallest 
10 binding part of an antibody (approximately 13kDa). Examples of this technology is 
disclosed in US6, 248, 516, US6, 291, 158, US6,127, 197 and EP0368684 which are 
all incorporated by reference in their entirety. 

In a fiirther preferred embodiment of the invention said antibodies are opsonic 
, 15 antibodies. 

Phagocytosis is mediated by macrophages and polymorphic leukocytes and involves 
the iiigestion and digestion of micro-organisms, damaged or dead cells, cell debris, 
insoluble particles and activated clotting factors. Opsonins are agente which 

20 facilitate , the phagocytosis of the above foreign bodies. Opsonic antibodies are 
therefore antibodies which provide the same function. Examples of opsonins are the 
Fc portion of an antibody or compliment C3. Antibodies raised by immunisation and 
in the form of an immune complex with antigen may bring about opsonisation via Ihe 
fixation of complement . on the antigen, or molecules in its immediate 

25 microenvironment 

In a furttier aspect of the invention there is provided a method for preparmg a 
hybridoma cell-line producing monoclonal antibodies according to the invention 
comprismg the steps of: 
30 i) • immunising an inmixmocompetent mammal with a chimeric 

polypeptide or preparation according to the invention; 
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ii) fusing lymphocytes of the immunised immunocompetent mammal 
with myeloma cells to form hybridoma cells; 

iii) screening monoclonal antibodies produced by the hybridoma cells of 
step (ii) for binding activity to polypeptide of the invention; 

5 iv) culturing the hybridoma cells to proUferate and/or to secrete said 

monoclonal antibody; and 
v) recovering the monoclonal antibody from the culture supematant. 

Preferably, said immunocompetent mammal is a rodent, for example a mouse, rat or 
10 hamster. 

According to a further aspect of the invention there is provided a hybridoma cell-line 
obtainable by the method according to the invention. 

1 5 According to a further aspect of the iiivention there is provided a method to prepare a 
vaccine preparation according to the invention comprising; 

i) forming a cell extract from a plant cell culture or plant according to 
the invention; and 

ii) separating said extract into an oil body rich fraction. . 

20 

In a preferred embodiment of the invention said extract is incnbated with a protease 
which cleaves said second part from said first part. 

Throughout the description and claims of this specification, the word "comprise" and 
25 variations of this word, for example* "comprising" and "comprises", means 
"including but not limited to", and is not intended to (and does not) exclude other 
. moieties, additives, components, integers or steps. 

An embodiment of the invention will be described by example only and with 
30 reference to the following figures; 
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Figure la is thfe nucleic acid sequence of OLBl; Figure lb is the amino acid sequence 
ofOLBl; 

Figure 2a is the nucleic add sequence of OLB 2; Figure 2b is the anuno acid 
sequence of 0LB2; 

5 Figure 3 is the amino acid sequence of the hydrophobic domain of OLB 1 ; 
Figure 4a and 4b is the nucleic acid sequence of a spider silk proteii^ and 
Table 1 shows the nucleic acid sequences of plant GTase cDNA's. 

10 

Materials and Methods 

Materials- Castor beans (Ricinus communis var. gjbsonii) were sup^Ued by Oiilteih 
Seeds (Ulverston, Cumbria, UK). The beans were soaked in running water for 1 d 
15 and germinated in llie dark on moist veraiicuUte at SO^'C. All reagents were obtained • 
from Sigma-Aldrich Co. (Gillingham, UK), except for glycerol. tri[l-^''C]oleate that 
was from Amersham Biosciences UK Ltd. (Uttle Chalfont, UK) and Flavobacterium 
meningsepticum propyl endopeptidase (PEP) from Seikagaku America (Fahnouth, 
USA). 

20 

Preparation of oil body membranes and proteolytic treatment - Oil bodies , were 
isolated from the endosperm of imbibed castor beans and the membranes delipidated 

■ using the methods of Hills et al., (2). Peripheral proteins were removed from the oil 

■ bodies using urea buffer, according to MilUchip et al, (3). The membrane fraction 
25 was solublized in SDS-loadmg buffer, heated at 70°C for 10 min and the 

polypeptides separated on a 14% SDS-PAGE gel as described by LaemmU (5) and 
stained using 0.2% (w/v) Coomassie blue R-250 in methanol: acetic acid: water 
(4:1:5 v/v/v). For proteolytic treatments purified urea washed oil bodies were 
incubated in O.IM phosphate buffer (pH 7) containing 0.2 U ml"' of PEP for Ih at 
30 3TC. The oil bodies were recovered by centrifiigation. 

23 
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Tryptic digestion and peptide sequence analysis - la-gel tryptic digestion was 
performed by washing the gel pieces in Eppendorf tubes three times with 50% (v:v) 
aqueous acetonitrile containing 25 mM ammonium bicarbonate and drying the gel 

5 pieces in a vacuum concentrator for 30 min. Sequencing-grade modified porcine 
trypsin (Promega) was dissolved in the 50 mM acetic acid supplied by the 
manufacturer, then diluted 5-fold by adding 25 mM ammonium bicarbonate 
containing 0.1% (v:v) octyl-6-D-glucopyranoside (Sigma) to give a final trypsin 
concentration is 0.02 \ig/\iL. Gel pieces were rehydrated by adding 10 \xL of trypsui 

10 solution, and after 30 mm enough trypsin solution was added to cover the gel. . 
Digests were incubated overnight at 37°C. 

Tryptic digests were applied directly to the MALDI target plate. Dried peptides were 
redissolved m 0,5 \iL of formic acid followed by addition of 9.5 ]iL of water, and one 

15 or more 0.5 nL aliquots were spotted' and dried onto the MALDI target plate. A ■ 
matrix solution containing 10 mg/mL 4-hydroxy-a-cyano-cinnamic acid (Sigma) in 
50% aqueous (v:v) acetonitrile containing 0.1% TFA (v:v) was freshly prepared. 
Serial 2-fold dilutions of matrix solution were spotted onto the MALDI target plate, 
and the dilution that gave a uniform layer of crystals when viewed in the mass 

20 spectrometer was selected for use with samples. A 0.5 nL aliquot of matrix solution 
was pipetted onto the dried peptide spot. In some cases, the matrix was added 
directly to the sample droplet on the target plate before drying. 

Positive-ion MALDI mass spectra were obtained using an Applied Biosystems. 4700 • 
25 Proteomics Analyzer (CTS version, Applied Biosystems, Foster City, CA, USA) in 
reflectron rnode with an accelerating voltage of 20 kV. MS spectra were acquired 
with a total of 1000 laser pulses over a mass range of m/z 800-4000. Final mass 
spectra were the summation of 20 sub-spectra, each acquired with 50 laser pulses, 
and internally caUbrated using the tiyptic peptides at m/z 842.509 and 2211.104. 
3 0 Monoisotopic masses were obtained from centroids of raw, unsmoothed data. 
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For CE)-MS/MS, a Source 1 accelerating voltage of 8 kV, a collision energy of 1 kV, 
and a Source 2 accelerating voltage of 15 kV were used. Air was used as tiie 
collision gas at the instrument's 'medium* pressure setting with a recharge threshold 
of 9.9x10"'' torr, which produced a Source 2 pressure of about 1x10"^ torr. The 
5 precursor mass window was set to +/-10 "Da", and the metastable suppressor was 
enabled. Tbe default calibration was used for MS/MS spectra. 

Mass spectral data obtained in batch mode were submitted to database searching 
using a locally-running copy of the Mascot program (Matrix Science Ltd., 

10 versionl.7). Batch-acquired MS and MS/MS spectral data were submitted to a 
combined peptide mass fingerprint and MS/MS ion search through the Applied 
Biosystans GPS Explorer software interface (version l.Oj to Mascot, Search criteria 
included; Maximum missed cleavages, 1; Variable modifications. Oxidation (M); 
Peptide tolerance, 25 ppm; MS/MS tolerance, 0.2 Da. Peptide sequence tags were 

15 generated 6om CTO-MS/MS spectra by manual interpretation or using a de novo 
sequencing program supplied by Applied Biosystems. 

RNA extraction, cDNA synthesis and PCR - Total RNA from various tissues was 
isolated using the RNeasy kit from Qiagen. The synthesis of single stranded cDNA 

20 was carried out using Superscript™ II KNfase H" revarse transcriptase from 
Invitrogen. Degenerate primors corresponding to peptide sequences 1 and 2 were 
designed (5'-ttgatagtirtyagyttyaga .and 5'-ctgtccraatgtrtaiarctt) and used to amphfy a 
fragment of the acid lipase (RcOBLl) cDNA from imbibed seed endosperm. Gene 
specific primers (5'-gaccacttggtatgggcatatgatgg and 5'-catgtcattgcagtaaaccaccctga) 

25 were then used to obtain tiie M-lengdi cDNA sequence by 3'- and 5 '-RACE using 
the SMART™ RACE cDNA Amplification kit from BD Biosciences, following the 
manufacturers protocols. Primers to a castor actin-like gene (RcACT) (5'- 
cgttctctccttgtatgccagtggtc and 5'-gagctgctcttggcagtctcaagttc) were used as a 
constitutive control far RT-PCR experiments on various tissues. 

30 
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A partial clone of RcOBL2 was obtained by performing PGR on cDNA from four-day 
old endosperm using different combinations of four pairs of degenerate primers 
designed using alignments of multiple 0BL4ike genes (5'- rtyagyttyagaggiachgarc or 
5'-rtyagyttyagaggiachgayc or 5'-rtyagyttyagaggcachgarc or 5*- 
5 rtyagyttyagaggcachgayc with S'-giaccatrtcrttrttrtabac or S'-giaccatrtcrttrcartabac or 
5'-giaciayrtcrttrttrtabac or 5*-giaciayrtcrttrcartabac). Gene specific primers (5'- 
taggtctgggcaacagaagtgacgctac and 5'-tgcccaaatgtgtatatgttcagcaacc) were then used to 
obtain the fuU-lengfli RcOBL2 cDNA sequence by 3'- and 5 '-RACE using the 
SMART™ RACE cDNA Amplification kit from BD Biosciences, following the 
10 manufacturers protocols. 

ABBREVIATIONS 

15 The abbreviations used are: TAG, triacylglycerol; CHAPS, 3-cholamidopropyl- 

dimethylammonio-l-propanesulphonate; PEP, propyl endopeptidase; TFA, 
trifluoroacetic acid; MALDI, matrix-assisted laser desorption/ionization; MS, mass 
spectrometry; CID, collision-induced dissociation; PGR, polymerase chain reaction; 
RT-PCR, reverse trauscriptase-PCR; RACE-PCR, rapid amplification if cDNA ends- 
20 . PCR; PAGE, polyacrylamide gel electrophoresis; MBP, maltose binding protein. 
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>UGT71B1 TABLE 1 . 

ATGAAAGTAGAACTTGTGTTCATACCATCGCCGGGCGTTGGCCATATCCGAGCAACAACG 

GCGTTAGCAAAGCTTCTCGTTGCCAGCGACAACCGCCTCTCCGTCACTCTCATCGTCATT 

CCTTCACGAGTCTCCGACGACGCTTCTTCCTCCGTCTACACGAACTCCGAAGACCGTCTC 

CGCTACATCCTCCTCCCCGCCCGAGATCAAACTACTGATCTCGTATCTTACATCGACAGC 

CAGAAACCACAAGTAAGAGCCGTCGTGTCCAAGGTCGCTGGAGATGTTTCAACACGTTCA 

GACTCACGGCTAGCTGGGATTGTCGTAGACATGTTCTGCACGTCCATGATAGACATCGCC 

GATGAGTTTAACCTCTCGGCTTATATCTTCTACACGTCCAACGCTTCTTATCTCGGGCTA 

CAGTTCCACGTTCAATCTCTTTACGACGAGAAAGAACTCGACGTAAGTGAGTTCAAAGAT 

ACGGAGATGAAGTTTGACGTTCCAACTCTGACTCAGCCTTTTCCGGCAAAATGTTTGCCT 

TCAGTGATGCTAAACAAGAAATGGTTTCCTTACGTTTTGGGTCGAGCTAGAAGTTTTAGA 

GCAACGAAGGGTATTTTGGTAAATTCGGTGGCTGACATGGAACCTCAGGCGTTGAGTTTC 

TTTTCCGGTGGAAATGGGAATACAAATATCCCTCCGGTGTACGCGGTTGGGCCCATTATG 

GACTTAGAATCTAGCGGCGAT6AAGAGAAGA6AAAGGAGATTTTACATTGGCTAAAAGAG 

CAACCGACGAAATCTGTAGTGTTTCTCTGTTTTGGGAGCATGGGAGGTTTCAGTGAGGAA 

CAAGCAAGAGAAATAGCTGTGGCGCTCGAGCGAAGCGGACACAGGTTTCTCTGGTCGCTT 

CGCCGCGCTTCTCCTGTTGGAAACAAGTCTAATCCTCCTCCCGGAGAATTCACGAACTTA 

GAGGAGATTCTTCCAAAAGGGTTTTTAGATCGGACGGTGGAGATAGGGAAGATCATAAGC 

TGGGCACCACAAGTAGATGTGTTGAATAGTCCTGCTATAGGAGCGTTCGTGACACATTGT 

GGATGGAACTCAATTCTCGAGAGTCTTTGGTTCGGTGTTCCGATGGCGGCGTGGCCTATC 

TATGCTGAGCAACAGTTTAACGCGTTTCATATGGTGGATGAGCTTGGTTTAGCGGCGGAG 

GTAAAGAAGGAGTACCGTAGAGATTTTCTGGTGGAGGAGCCGGAGATTGTGACGGCTGAT 

GAGATAGAGAGAGGGATCAAGTGTGCGATGGAGCAGGATAGCAAGATGAGGAAGAGGGTG 

ATGGAGATGAAGGATAAGCTCCACGTGGCGTT6GTGGACGGTGGATCTTCGAACTGTGCT 

CTAAAGAAGTTTGTTa\AGACGTGGTGGATAATGTTCCATAA 

>UGT71B2 

ATGAAACTGGAGCTGGTGTTCATACCATCACCTGGTGACGGACATCTCCGGCCATTAGTG 

GAGGTAGCTAAGCTTCATGTTGACCGTGACGACCATCTCTCCATCACCATCATCATCATC 
CCTCAGATGCATGGATTTAGTAGCAGTAACTCTTCTTCTTACATCGCTTCTCTCTCCTCT 
GATTCTGAAGAACGTCTTAGCTACAACGTTCTCTCCGTCCCTGATAAACCAGACTCCGAT 
GACACCAAACCACATTTTTTCGACTACATTGATAACTTCAAGCCGCAGGTCAAAGCCACG 
GTGGAAAAACTTACTGAGCCGGGTCCACCAGATTCGCCGTCGCGTCTTGCTGGATTCGTG 
GTGGATATGTTTTGCATGATGATGATTGATGTCGCTAATGAGTTTGGTGTTCCCAGTTAC 
ATGTTTTACACATCCAACGCAACGTTTCTTGGATTGCAAGTTCATGTTGAATACCTTTAC 
GACGTTAAGAACTATGACGTTAGTGACCTCAAGGACTCGGACACTACTGAGCTGGAAGTT 
CCTTGTTTGACTCGTCCTTTACCGGTTAAGTGTTTCCCCTCGGTTCTATTAACCAAGGAG 
TGGTTACCGGTTATGTTTAGACAAACCAGAAGATTCCGAGAAACTAAAGGTATTTTGGTA 
AATACATTCGCTGAGCTTGAGCCTCAAGCTATGAAGTTTTTCTCCGGCGTAGATAGTCCT 
CTGCCTACGGTGTACACAGTTGGACCGGTTATGAATCTTAAAATCAACGGTCCAAATTCA 
TCTGACGATAAGCAATCGGAGATCCTACGGTGGCTAGACGAGCAGCCACGTAAATCCGTT 
GTTTTCCTCTGTTTCGGAAGCATGGGAGGTTTCCGTGAGGGCCAAGCTAAAGAAATCGCA 
ATCGCGCTTGAGCGAAGTGGTCACCGCTTTGTCTGGTCTCTTCGTCGTGCTCAACCAAAA 
GGATCGATAGGACCTCCCGAAGAATTTACGAATCTTGAGGAAATTCTCCCGGAAGGATTC 
TTGGAACGGACGGCAGAGATAGGAAAGATTGTAGGTTGGGCTCCACAAAGCGCCATTCTA 
GCAAATCCTGCGATCGGAGGGTTCGTGTCGCATTGTGGATGGAACTCGACGCTAGAGAGT 
CTATGGTTCGGAGTTCCGATGGCTACGTGGCCGCTTTACGCAGAGCAACAAGTTAACGCG 
TTCGAGATGGTTGAGGAGCTAGGGCTAGCGGTGGAGGTCCGAAATAGTTTCCGAGGAGAT 
TTCATGGCGGCGGATGATGAGTTGATGACGGCAGAGGAGATAGAGAGAGGGATCCGGTGT 
TTGATGGAGCAGGATAGTGACGTGAGGAGTAGAGTGAAGGAGATGAGCGAGAAGAGTCAG 
GTAGCTTTAATGGACGGTGGATCTTCGCACGTTGCTCTTCTAAAGTTTATTCAAGACGTC 

ACTAAGAATATCTCTTGA , ! 

>UGT71B5 ' . 
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ATGAAGATTGAGCTTGTGTTCATACCTTTGCCGGGGATTGGTCATCTCAGGCCAACCGTG 
AAGCTAGCGAAGCAACTCATAGGCAGCGAAAACCGTCTTTCGATCACCATAATCATCATC 
CCTTCAAGATTTGACGCCGGTGATGCATCCGCCTGTATCGCATCTCTCaVCCACGTTGTCT 

CAAGATGATCGCCTCCATTACGAATCCATATCCGTCGCaiAAACAACCA 

CCGGATCCTGTTCCGGCTCAAGTGTACATAGAGAAACAAAAGACGAAAGTGAGAGATGCA 

GTCGCGGCGAGAATCGTCGATCCAACAAGAAAGCTCGCGGGATTCGTGGTGGACATGTTC 

TGTTCCTCGATGATCGATGTAGCTAACGAGTTTGGAGTTCCGTGTTATATGGTATACACA 

TCGAACGCTACGTTTTTAGGAACCATGCTTCACGTTaU^CAAATGTACG 

TATGACGTCAGCGAGTTAGAAAACTCGGTCACCGAGTTGGAGTTTCCGTCTCTGACTCGT 

CCTTATCCAGTGAAGTGTCTTCCTCATATCCTCACTTCAAAGGAGTGGTTACCTCTCTCT 

CTAGCTCAAGCTAGGTGTTTCCGGAAGATGAAGGGTATTTTGGTAAATACAGTTGCTGAG 

CTTGAACCTCACGCTTTGAAAATGTTCAATATTAATGGTGACGATCTTCCTCAAGTTTAT 

CCTGTTGGACCAGTGTTGCATCTCGAAAACGGCAATGACGATGATGAGAAGCAATCGGAA 

ATTTTGCGGTGGCTCGACGAGCAACCGTCTAAATCTGTTGTGTTTCTCTGCTTT6GGAGC 

TTGGGAGGTTTCACTGAAGAACT^CAAGAGAAACCGCTGTGGCCCTAGATAG 

O^GCGGTTTCTTTGGTGTGTTCGTCACGCATCGCCAAATATAAAAACAGATCGTCCCAGA 

GATTACACGAATCTTGAGGAGGTTTTACCGGAGGGGTTCTTGGAACGGACTTTGGATAGA 

GGGAAAGTGATTGGATGGGCACCACAAGTGGCGGTACTAGAGAAGCCGGCGATAGGAGGG 

TTTGTCACTCACTGCGGTTGGAACTCTATTTTAGAGAGCTTGTGGTTCGGTGTTCCAATG 

GTGACGTGGCCGCTATACGCGGAACAGAAGGTTAACGCGTTTGAGATGGTTGAGGAGCTG 

GGTTTGGCGGTGGAGATACGGAAGTACTTAAAAGGAGATTTGTTCGCCGGAGAGATGGAG 

ACGGTTACCGCGGAGGATATAGAGAGAGCCATTAGGCGTGTGATGGAGCAAGACAGTGAC 

GTTAGGAACAACGTGAAAGAGATGGCGGAGAAGTGCCACTTCGCGTTAATGGACGGTGGA 

TCTTCGAAGGC(3GCTTTGGAAAAGTTTATTCAAQACGTGATAGAGAATATGGATTAA 

>UGT71B6 • 

ATGAAAATAGAGCTAGTATTCATTCCCTCTCCGGCAATTAGTCATCTCATGGCGACGGTA 
GAGATGGCGGAGCAACTAGTTGATAAAAACGACAACCTCTCTATCACCGTAATCATCATA 
TCTTTTAGTTCTAAAAATACATCCATGATCACCTCTCTTACATCCAACAACCGCCTCCGG 
TACGAAATAATCTCCGGAGGAGATCAACAACCAACGGAGCTCAAAGCAACTGATTCCCAC 
ATCCAAAGTCTAAAGCCACTGGTGAGAGACGCGGTTGCTAAACTCGTAGATTCCACTCTA 
CCAGACGCGCCTCGTCTTGCGGGATTCGTTGTTGACATGTACTGCACGTCGATGATCGAT 
GTCGCTAACGAATTTGGCGTCCCTAGTTACTTGTTTTACACCTCTAACGCTGGATTTCTT 
GGACTTTTGCTTCACATTCAGTTCATGTACGATGCAGAGGATATCTATGACATGAGCGAA 
TTAGAAGACTCTGACGTAGAGTTGGTGGTTCCGAGTTTGACTAGTCCTTATCCGTTGAAA 
TGTCTTCCTTACATTTTCAAATCAAAAGAGTGGCTCACTTTTTTTGTAACTCAAGCGAGA 
AGATTCAGAGAAACTAAGGGCATTTTGGTAAACACGGTTCCTGACTTGGAACCTCAAGCG 
TTGACGTTTCTTTCCAATGGTAACATTCCACGTGCTTACCCAGTAGGACCATTGTTGCAT 
CTCAAAAACGTAiUlTTGTGATTACGTGGACAAGAAGCAATCGGAGATTTTACGGTGGCTA 
GACGAGCAACCGCCAAGATCTGTAGTGTTCCTCTGTTTCGGGAGCATGGGAGGGTTCAGT 
GAGGAACAAGTGAGAGAAACCGCATTAGCTCTCGATCGAAGCGGCCACCGGTTTCTTTGG 
TCTCTCCGTCGTGCATCTCCGAATATATTGAGAGAGCCTCCCGGAGAATTCACAAACCTA 
GAGGAGATTCTCCCAGAAGGGTTTTTCGATCGGACGGCTAACAGAGGAAAGGTTATCGGA 
TGGGCTGAACAGGTGGCCATATTGGCGAAGCCGGCGATCGGAGGTTTTGTTTCTCACGGC 
GGATGGAATTCGACGTTGGAGAGTTTGTGGTTTGGTGTTCCGATGGCGATTTGGCCGCTT 
TACGCTGAACAGAAGTTTAACGCTTTCGAGATGGTGGAAGAGCTTGGTTTGGCTGTGGAG 
ATCAAGAAGCATTGGCGAGGAGATCTTTTGTTGGGGAGGTCGGAGATTGTGACGGCG6AG 
GAGATTGAGAAAGGAATCATATGTTTGATGGAGCAAGACAGTGACGTCAGGAAGAGAGTG 
AATGAGATCAGCGAGAAGTGCCAGGTGGCTTTAATGGACGGTGGATCGTCAGAAACTGCT 
TTGAAAAGATTTATTCAAGACGTAACGGAGAATATTGCTTGGTCGGAAACTGAAAGCTAG 
>UGT71B7 

ATGAAATTTGAGCTTGTTTTCATCCCCTATCCCGGAATCGGTCATCTCCGATCAACGGTA 
GAAATGGCAAAGCTACrAGTGGACCGTGAAACTCGTCTCTCTATCTCCGTTATCATCCTT 
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CCTTTCATTTCCGAAGGCGAAGTCGGTGCTTCCGATTACATCGCAGCCCTCTCCGCCTCA 

TCCAACAACCGCCTCCGCTACGAAGTTATCTCCGCCGTAGATCAACCAACCATCGAGATG 

ACGACAATTGAAATCCATATGAAGAACCAAGAACCAAAGGTGAGAAGCACCGTTGCAAAA 

CTCCTTGAAGACTATTCGTCTAAACCGGACTCGCCGAAGATCGCTGGCTTTGTTCTAGAC 

ATGTTTTGCACTTCGATGGTAGATGTAGCGAACGAGTTTGGTTTCCCGAGTTATATGTTT 

TACACCTCCAGTGCCGGGATTCTCTCAGTTACATATCATGTTCAAATGTTGTGCGATGAG 

AACAAGTACGATGTTAGTGAAAATGATTATGCAGACTCGGAAGCTGTGTTGAACTTTCCG 

AGTTTGAGTCGTCCTTATCCGGTGAAGTGTCTTCCTCACGCTCTGGCAGCTAATATGTGG 

CTCCCGGTGTTTGTAAACCAAGCGAGAAAGTTTAGGGAGATGAAAGGTATTTTGGTAAAT 

ACTGTTGCTGAGCTT6AACCTTATGTGTTAAAGTTTCTTTCTAGTAGTGATACTCCTCCT 

GTTTATCCTGTTGGACCATTGTTGCATCTTGAGAACCAACX5TGATGATTCTAAGGACGAG 

AAACGGTTGGAGATTATACGGTGGTTGGATCAGCAACCACCAAGTTCGGTTGTGTTTCTC 

TGCTTTGGGAGCATGGGAGGCTTCGGTGAGGAACAAGTAAGAGAGATCGCAATCGCGTTA 

GAGCGAAGTGG6CACCGGTTTCTCTGGTCTCTTCGTC6CGCATCTCCGAATATATTCAAA 

GAACTTCCAGGAGAGTTTACTAATCTAGAGGAAGTTCTCCCGGAAGGATTCTTTGATCGA 

ACGAAAGATATAGGTAAAGTGATTGGATGGGCTCCACAAGTAGCCGTTCTTGCGAATCCG 

GCTATAGGAGGTTTCGTAACTCATTGCGGGTGGAATTCTACGCTAGAGA6TCTTTGGTTT 

GGTGTTCCAACAGCTGCATGGCCGTTATACGCAGAGCAGAAGTTCAATGCTTTCTTAATG 

GTGGAGGAGCTTGGATTGGCGGTGGAGATAAGGAAGTATTGGCGAGGTGAACATTTGGCG 

GGATTACCGACGGCTACTGTGACAGCGGAGGAGATAGAGAAAGCAATCATGTGTCTAATG 

GAACAAGATAGTGACGTGAGGAAAAGAGTGAAGGATATGAGCGAGAAATGCCATGTGGCT 

TTAATGGATGGTGGATCGTCGCGTACTGCGTTGGAAAAGTTTATTGAAGAGGTTGCGAAG 

AATATAGTTTCACTAGATAAGGAATTTGAGCATGTAGCTCTTAAATGA 

>UGT7 1B8 . — - 

ATGAACAAATTTGCGCTTGTCTTCGTACCATTTCCTATACTTGGTCATCTCAAATCAACC 

GCCGAGATGGCTAAGCTACTAGTGGAGCAAGAAACTCGCCTCTCTATCTCCATTATCATC 

CTTCCTCTTCTTTCCGGAGACGACGTCAGTGCTTCCGCTTATATCTCAGCTCTTTCCGCC 

GCATCCAACGACCGCCTTCACTATGAAGTGATCTCGGACGGAGATCAACCAACCGTCGGG 

TTACATGTCGATAACCACATCCCGATGGTGAAACGTACCGTTGCAAAACTCGTTGATGAC 

TACTCAAGGCGGCCGGACTCGCCGAGGCTCGCTdGTTTAGTTGTTGACATGTTTTGTATC 

TCGGTGATAGACGTGGCTAATGAGGTTAGTGTTCCGTGTTACTTGTTTTACACGTCAAAC 

GTTGGGATTCTTGCTCTTGGGTTACATATTCAGATGTTGTTTGATAAGAAGGAGTACAGT 

GTCAGTGAAACTGATTTTGAAGACTCGGAAGTTGTGTTGGATGTTCCGAGTTTGACTTGT 

CCTTATCCGGTGAAGTGTCTTCCTTATGGTTTGGCAACGAAAGAGTGGCTTCCTATGTAT 

CTAAATCAAGGTAGAAGATTCAGAGAGATGAAAGGTATTTTGGTAAATACTTTTGCTGAG 

CTTGAACCTTATGCGTTGGAGTCTCTTCACTCTAGTGGTGATACTCCTCGTGCTTATCCA 

GTGGGACCATTGTTGCATCTCGAGAACCATGTTGACGGTTCTAAAGACGAGAAGGGTTCG 

GACATTTTACGGTGGTTAGATGAACAACCACCTAAATCGGTAGTGTTCCTCTGCTTTGGA 

AGCATAGGAGGCTTTAACGAGGAACAAGCAAGAGAAATGGCCATTGCACTTGAGAGAAGT 

GGTCACCGCTTCTTGTGGTCTCTTCGCCGTGCATCTCGAGATATAGATAAGGAACTTCCC 

GGAGAATTCAAGAATCTTGAAGAAATTCTCCCGGAAGGATTCTTTGATCGGACAAAGGAT 

AAAGGAAAGGTGATCGGATGGGCTCCACAAGTAGCCGTGCTGGCTAAGCCAGCAATCGGA 

GGTTTTGTTACTCATTGCGGGTGGAACTCGATACTCGAGAGTCTTTGGTTCGGTGTTCCT 

ATAGCGCGATGGCCGTTATACGCTGAGCAGAAGTTTAATGCTTTCGTGATGGTGGAGGAG 

CTTGGTTTGGCAGTGAAGATAAGAAAGTATTGGCGAGGCGATCAGTTGGTGGGAACGGCG 

ACGGTCATAGTGACGGCAGAGGAGATAGAGAQAGGAATCAGATGTTTGATGGAGCAAGAT 

AGTGACGTGAGGAATAGAGTGAAGGAGATGAGTAAGAAATGTCACATGGCTTTAAAGGAT 

GGTGGCTCGTCTCAATCTGCTTTGAAATTATTTATTCAAGACGTTACGAAGTATATTGCT 

TGA ■■ ^ : 

>UGT71C1 — _- 

ATGGGGAAGCAAGAAGATGCAGAGCTCGTCATCATACCTTTCCCTTTCTCCGGACACATT 

CTCGCAACAATCGAACTCGCCAAACGTCTCATAAGTCAAGACAATCCTCGGATCCACACC 
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ATCACCATCCTCTATTGGGGATTACCTTTTATTCCTCaAGCTGACACaATCGCTTTCCTC" 

CGATCCCTAGTCAAAAATGAGCCTCGTATCCGTCTCGTTACGTTGCCCGAAGTCCAAGAC 

CCTCCACCAATGGAACTCTTTGTGGAATTTGCCGAATCTTACATTCTTGAATACGTCAAG 

AAAATGGTTCCCATCATCAGAGAAGCTCTCTCCACTCTCTTGTCTTCCCGCGATGAATCG 

GGTTCAGTTCGTGTGGCTGGATTGGTTCTTGACTTCTTCTGCGTCCCTATGATCGATGTA 

GGAAACGAGTTTAATCTCCCTTCTTACATTTTCTTGACGTGTAGCGCAGGGTTCTTGGGT 

ATGATGAAGTATCTTCCAGAGAGACACCGCGAAATCAAATCGGAATTCAACCGGAGCTTC 

AACGAGGAGTTGAATCTCATTCCTGGTTATGTCAACTCTGTTCCTACTAAGGTTTTGCCG 

TCAGGTCTATTCATGAAAGAGACCTACGAGCCTTGGGTCGAACTAGCAGAGAGGTTTCCT 

GAAGCTAAGGGTATTTTGGTTAATTCATACACAGCTCTCGAGCCAAACGGTTTTAAATAT 

TTCGATCGTTGTCCGGATAACTACCCAACCATTTACCOATCGGGCCGATATTATGCTC 

AACGACCGTCCGAATTTGGACTCATCGGAACGAGATCGGATCATAACTTGGCTAGATGAC 

CAACCCGAGTCATCGGTCGTGTTCCTCTGTTTCGGGAGCTTGAAGAATCTCAGCGCTACT 

CAGATCAACGAGATAGCTCAAGCCTTAGAGATCGTTGACTGCAAATTCATCTGGTC6TTT 

CGAACCAACCCGAAGGAGTACGCGAGCCCTTACGAGGCTCTACCACACGGGTTCATGGAC 

CGGGTCATGGATCAAGGCATTGTTTGTGGTTGGGCTCCTCAAGTTGAAATCCTAGCCCAT 

AAAGCTGTGGGAGGATTCGTATCTCATTGTGGTTGGAACTCGATATTGGAGAGTTTGGGT 

TTCGGCGTTCCAATCGCCACGTGGCCGATGTACGCGGAACAACTyVCTAAACGCGTTCACG 

ATGGTGAAGGAGCTTGGTTTAGCCTTGGAGATGCGGTTGGATTACGTGTCGGAAGATGGA 

GATATAGTGAAAGCTGATGAGATCGCAGGAACCGTTAGATCTTTAATGGACGGTGTGGAT 

GTGCCGAAGAGTAAAGTGAAGGAGATTGCTGAGGCGGGAAAAGAAGCTGTGGACGGTGGA 

TCTTCGTTTCTTGCGGTTAAAAGATTCATCGGTGACTTGATCGACGGCGTTTCTATAAGT 

AAGTAG [ - • 

>UGT71C2 . [ : 

ATGGCGAAGCAGCAAGAAGCAGAGCTCATCTTCATCCCATTTCCAATCCCCGGACACATT 

CTCGCCACAATCGAACTCGCGAAACGTCTCATCAGTCACC^CCTAGTCGGATCCACACC 

ATCACCATCCTCCATTGGAGCTTACCTTTTCTTCCTCAATCTGACACTATCGCCTTCCTC 

AAATCCCTAATCGAAACAGAGTCTCGTATCCGTCTCATTACCTTACCCGATGTCCAAAAC 

CCTCCACCAATGGAGCTATTTGTGAAAGCTTCCGAATCTTACATTGTTGAATACGTCAAG 

AAAATGGTTCCTTTGGTCAGAAACGCTCTCTCCACTCTCTTGTCTTCTCGTGATG^ 

GATTCAGTTCATGTCGCCiSGATTAGTTCTTGATTTCTTCTGTGTCCCTTTGATCGATGTC 

GGAAACGAGTTTAATCTCCCTTCTTACATCTTCTTGACGTGTAGCGCAAGTTTCTTGGGT 

ATGATGAAGTATCTTCTGGAGAGAAACCGCGAAACCAAACCGGAACTTAAqCGGAGCTCT 

GACGAGGAAACAATATCAGTTCCTGGTTTTGTTAACTCCGTTCCGGTTAAAGTTTTGCCA 

CCGGGTTTGTTCACGACTGAGTCTTACGAAGCTTGGGTCGAAATGGCGGAAAGGTTCCCT 

GAAGCCAAGGGTATTTTGGTCAATTCATTTGAATCTCTAGMCGTAACGCTTTT6ATTAT 

TTCGATCGTCGTCCGGATAATTACCCACCCGTTTACCCAATCGGGCCAATTCTATGCTCC 

AACGATCiGTCCGAATTTGGATTTATCGGAACGAGACCGGATCTTGAAATGGCTCGATGAC 

CAACCCGAGTCATCTGTTGTGTTTCTCTGCTTCGGGAGCTTGAAGAGTCTCGCTGCGTCT 

O^GATTAAAGAGATCGCTCS^GCCTTAGAGCTCGTCGGAATd^^ 

CGAACGGACCCGAAGGAGTACGCGAGCCCGAACGAGATTTTACCGGACGGGTTTATGAA^ 
;CGAGTCATGGGTTTGGGCCTTGTTTGTGGTTGGGCTCCTCAAGTTGAAATTCTGGCCCAT 
AAAGCAATTGGAGGGTTCGTGTCACACTGCGGTTGGiV^CTCGATATTGGAGAGTTTGCGT 
TTCGGAiSTTCCAATTGCCACGTGGCCAATGTACGCGGAACAACAACTAAACGCGTTCACG 
ATTGTGAAGGAGCTTGGTTTGGCGTTGGAGATGCGGTTGGATTACGTGTCGGAATATGGA 
GAAATCGTGAT^GCTGATGAAATCGCAGGAGCCGTACGATCTTTGATGGACGGTGAGGAT 
GTGCCGAGGAGGAAACTGAAGGAGATTGCGGAGGCGGGAAAAGAGGCTGTGATGGACGGT 
GGATCTTCGTTTGTTGCGGTTAAAAGATTCATAGATGGGCTTTGA 

>UGT71C3- ■ ' 

ATGAAAGCAGAAGCAGAGATCATCTTCGTTACATATCCATCCCCTGGTCATCTTCTTGTC 
TCCATTGAATTCGCTAAATCTCTCATCAAACGTGATGATCGCATCCACACCATCACCATC 
CTCTACTGGGCTTTACC'TCTCGCTCCTCAAGCCCACCTTTTCGCTAAGTCCCTCGTTGCT 
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TCACAGCCTCGAATCCGTCTCCTTGCGTTGCCTGATGTTCAAAACCCTCCACCATTGGAA 
CTCTTCTTTAAAGCTCCCGAAGCTTATATTCTTGAGTCCACCAAGAAAACAGTTCCTTTA 
GTCAGAGACGCTCTCTCCACTCTAGTTTCTTCACGTAAAGAATCCGGTTCGGTTCGTGTA 
GTCGGTTTGGTTATCGATTTTTTTTGTGTTCCAAT6ATCGAAGTGGCAAACGAGCTTAAC 
CTTCCTTCTTACATCTTCCTAACGTGTAACGCTGGGTTTTTAAGTATGATGAAGTATCTC 
CCTGAGAGACATCGCATAACCACTTCTGAGCTAGATTTAAGCTCCGGCAACGTAGAACAT 
CCAATTCCTGGCTACGTCTGCTCCGTGCCGACGAAGGTTTTGCCTCCAGGTCTATTCGTG 
AGAGAGTCCTACGAGGCTTGGGTCGAGATTGCAGAGAAGTTCCCTG6AGCCAAGGGCATT 
TTGGTAAACTCAGTCACATGTCTTGAGCAGAATGCATTTGATTACTTCGCTCGTCTTGAT 
GAGAACTATCCTCCGGTTTACCCGGTCGGACCGGTTCTTAGTTTGAAGGATCGTCCGTCT 
CCAAATCT'GGACGCATCX3GACCGGGATCGGATCATGAGATGGCTCGAGGACCAGCCGGAG 

tcgtcaattgtgtatatctgcttcggaagcctcggaatcattggcaagctgcagattgaa 
gagatagctgaagccttggaactcaccggccacaggtttctttggtcaatacgtacaaat 

CCGACGGAGAAAGCGAGCCCGTACGATCTGTTGCCGGAGGGATTTCTCGATCGGACGGCC 

agtaagggattggtgtgtgattgggccccgcaagtagaagttctggcccataaagcgctc 
ggaggattcgtgtctcactgcggttggaactctgtactggagagcttatggttcggtgtt 
ccgatcgccacgtggccaatgtacgctgagcaacag'ttaaacgcattctcgatggtgaag 
gagttagggttagccgtggagctgcgtttagactacgtttcggcgtacggagagatagta 
aaagctgaggagatcgcgggagccatacgatcattgatggacggtgaggatacgccgagg 

AAGAGAGTGAAGGAGATGGCGGAAGCGGCGAGGAATGCTTTGATGGACGGAGGATCTTCG 

tttgttgcggttaaacgatttctcgacgagttgatcgqcggagatgtttag 



>UGT71C4 ; : 

atggtgaaggaaacagagctaatcttcattccagttccatccacaggtcatattctcgtc 
catattgaattcgccaagcgtctcatcaatctcgaccatcggatccacaccatcactatt 
ctcaacttatcctcaccctcttctcctcacgcctccgtcttcgccagatctctcatcgct 
'tcccagcccaaaatccgtctccacgaccttccccctat.ccaagatcctcctccattc6at 
ctttaccaaagagctcccgaagcttacatagtaaaactcatcaagaaaaatactcctctg 
ataaaagacgccgtctccagcatcgtcgcgtcgcgtcgtggaggctcagattcggttcaa 
gtcgccggtttggttctcgatttattctgcaattcattggtaaaagatgttggcaacgag 
cttaatcttccttcttacatataccttacgtgtaacgctagatacttggggatgatgaaa 
tatattccggatcggcatcggaaaatcgcatctgagttcgatttgagctccggcgatgaa 
gaattgccggttccgggattcataaacgctattccgacgaaatttatgccgcctggattg 
ttcaataaggaagcttacgaggcttacgtagagctagcgccgagattcgcagatgcgaag 
ggtattttggttaattccttcacggagcttgagccgcacccgtttgactatttctctcac 
ctggagaaattccctccggtttacccggtcggaccgattctcagcttgaaagatcgagcg 
agtccgaacgaagaagcagtcgatcgggatcagatcgttgggtggctcgatgatcagccg 
gagtcatcggtggtgttcctctgtttcgggagcagaggaagcgttgatgagccgcaagtg 
aaggagatagctcgagctttggaactcgtcggctgcagatttctttggtcaattagaaca 
'agcggcgacgtcgagacgaatcctaacx3Atgtgttgccggaggggttcatgggccgagta 
gcaggccgaggtttggtatgtggttgggctccacaagtggaagtgttggcccataaagca 
ataggaggatttgtgtctcactgtggttggaactccacgcttgaaagcttatggttcggg 
gttcctgtcgcaacgtggccgatgtacgcagagcaacagcttaacgccttcacgctggtg 
aaagagcttgggcttgcggtggacctgcggatggattacgtgtcgagtcgtgggggtttg 
gtgacttgtgatgagatagccagagccgtacgatctttgatggacggtggagatgagaag 
agaaaaaaggttaaggagatggctgatgcggcaaggaaggctttgatggatggaggatcg 
tcttctttggcaactgctcgattcatcga^gaattgtttgaagatggttcgtcgtgctaa 



>UGT71C5 ■ 

ATGAAGACAGCAGAGCTCATATTCGTTCCTCTGCCGGAGACCGGCCATCTCTTGTCAACG 

ATCGAGTTTGGAAAGCGTCTACTCAATCTAGACCGTCGGATTTCTATGATTACAATCCTC 

TCCATGAATCTTCCTTACGCTCCTCACGCCGACGCTTCTCTTGCTTCGCTAACAGCCTCC 
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GAGCCTGGTATCCGAATCATCa.GTCTCCCGGAGATCCACGATCCACCTCCGATCAAGCTT 

CTTGACACTTCCTCCGAGACTTACATCCTCGATTTCATCCATAAAAACATACCTTGTCTC 

AGAAAAACCATCCAAGATTTAGTCTGA.TCATCATCATCTTCCGGAGGTGGTAGTAGTC^ 

GTCGCCGGCTTGATTCTTGATTTCTTGTGCGTTGGTTTGATCGACATC6GCCGTGAGGTA 

AACCTTCCTTCCTATATCTTCATGACTTCCAACTTTGGTTTCTTAGGGGTTCTACAGTAT 

CTCCCGGAACGACAACGTTTGACTCCGTCGGAGTTCGATGAGAGCTCCGGCGAGGAAGAG 

TTACATATTCCGGCGTTTGTGAACCGTGTTCCCGCCAAGGTTCTGCCGCCAGGTGTGTTC 

GATAAACTCTCTTACGGGTCTCTGGTCa^AAATCGGCGAGCGATTACATGAAGCCAAGGGT 

ATTTTGGTTAATTCATTTACCCAAGTGGAGCCTTATGCTGCTGAACATTTTTCTCAAGt^ 

CGAGATTACCCTCACGTGTATCCTGTTGGGCCGGTTCTCAACTTAACGGGCCGTACAAAT 

CCGGGTCTAGCTTCGGCCCAATATAAAGAGATGATGAAGTGGCTTGACGAGCa^ 

TCGTCGGTTTTGTTCCTGTGTTTCGGGAGCATGGGAGTCTTCCCTGCACCTCAGATCACA 

GAGATTGCTCACGCGCTCGAGCTTATCGGGTGCAGGTTCATCTGGGCGATCCGTACGAAC 

ATGGCGGGAGATGGCGATCCTCAGGAGCCGCTTCCAGAAGGATTTGTCGATCGAACAATG 

GGCCGTGGAATTGTGTGTAGTTGGGCTCCACAAGTGGATATCTTGGCCCACAAGGCAACA 

GGTGGATTCGTTTCTCACTGCGGGTGGAATTCCGTCOy^GAGAGTCTATGGTACGGTGTA 

CCTATTGCAACGTGGCCAATGTATGCGGAGCAACAACTGAACGCATTTGAGATGGTGAAG 

GAGTTGGGCTTAGCAGTGGAGATAAGGCTTGACTACGTGGCGGATGGTGATAGGGTTACT 

TTGGAGATCGTGTCAGCCGATGAAATAGCCACAGCCGTCCGATCATTGATGGATAGTGAT 

AACCCCGTGAGAAAGAAGGTTATAGAAAAATCTTCAGTGGCGAGGAAAGCTGTTGGTGAT 

GGTGGGTCTTCTACGGTGGCCACATGTAATTTTATCAAAGATATTCTTGGGGATCACTTT 

TGA ' . ' ; ^ • • 



>UGT71D1 

ATGCGGAATGTAGAGCTCATCTTCATCCCCACACCAACCGTTGGTCATCTTGTTCCGTTT 
CTTGAATTTGCTAGGCGTCTCATTGAGCAAGATGATAGGATCCGTATCACAATCCTCTTG 
ATGAAACTACAAGGTCAGTCTCATCTAGACACTTATGTTAAATCAATTGCCTCCTCTCAA 
CCGTTTGTTAGATTCATTGATGTCCCTGAGTTAGAGGAGAAACCTACACTTGGTAGTACA 
CAATCTGTGGAAGCTTATGTGTATGATGTTATTGAGAGAAATATCCCTCTTGTGAGGAAT 
ATAGTCATGGATATTTTAACTTCTCTTGCATTGGATGGAGTTAAGGTCAAGGGATTAGTT 
GTTGACTTTTTCTGTCTCCCTATGATTGACGTTGCTAAAGATATAAGTCTCCCTTTCTAT 
GTGTTCTTGACTACAAATTCCGGGTTCTTAGCTATGATGCAGTATCTAGCAGATCGACAT 
AGTAGAGATACATCGGTTTTTGTAAGAAACTCGGAAGAAATGTTGTCGATACCTGiSATTT 
GTAAACCCTGTCCCAGCCAATGT.TCTGCCGTCAGCTCTGTTTGTTGAAGATGGTTATGAT 
GCTTACGTTAAGCTGGCCATATTGTTtACAAAGGCCAATGGAATCCTAGTGAATAGCTCC 
TTTGATATTGAGCCTTACTCTGTGAATCATTTTCTTCAAGAACAGAATTATCCTTCTGTT 
TATGCT6TTGGCCCCATATTTGACTTGAAAGCCCAGCCTCATCCAGAGCAGGACCTAACC 
CGTCGTGACGAGTTGATGAAATGGCTTGATGATCAACCCGAGGCATCGGTTGTATTCCTT 
TGTTTTGGGAGTATGGCAAGGTTAAGAGGTTCTCTAGTGAAGGAAATAGCTCATGGACTT 
GAGCTATGTCAATATAGATTCCTCTGGTCACTCCGTAAAGAAGAG6TGACAAAGGATGAT 
TTGCCAGAGGGGTTCCTTGACCGTGTCGATGGACGTGGAATGATATGTGGTTGGTCTCCT 
CAGGTAGAAATACTGGCCCATAAGGCAGTGGGAGGCTTTGTTTCTCACTGTGGATGGAAC 
TCAATAGTAGAGAGTTTGTGGTTTGGCGTGCCAATTGTGACATGGCCAATGTATGCAGAG 
CAACAACTCAATGCGTTTCTGATGGTGAAGGAACTGAAGCTAGCTGTGGAGCTGAAGCTT 
GATTACAGGGTACATAGTGATGAGATAGTAAACGCAAACGAGATAGAGACCGCTATTCGT 
TATGTAATGGACACGGATAATAATGTTGTGAGGAAACGAGTGATGGATATCTCGCAGATG 
ATCCAGAGAGCTACGAAGAATGGTGGATCTTCGTTTGCCGCAATTGAGAAATTCATATAT 
GACGTGATAGGAATTAAGCCCTAG ■ 



>UGT71D2 • ; 

ATGAGGAATGCAGAGCTCATCTTCATCCCAACACCAACTGTTGGTCATCTTGTTCCGTTT 
CTTGAATTTGCTAGGCGTCTCATTGAGCAGGATGATAGAATCCGTATCACCTTCCTCTTG 
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ATGAAGCAACAAGGTCAGTCTCATCTGGATTCCTATGTTAAGACAATTTCCTCGTCTCTG 

CCGTTTGTTAGATTTATTGATGTCCCTGAGTTAGAGGAGAAACCAACACTTGGTACACAG 

TCTGTGGAAGCCTATGTGTACGATTTTATTGAAACAAATGTCCCTCTTGTGCAAAATATA 

ATCATGGGTATCCTATCTTCTCCTGCATTTGATGGAGTTACGGTCAAGGGATTCGTTGCT 

GATTTTTTCTGTCTCCCGATGATTGATGTTGCAAAAGATGCAAGTCTTCCTTTTTATGTG 

TTCTTGACTTCAAATTCCGGATTCCTAGCTATGATGCAGTATCTGGCATATGGACATAAG 

AAAGATACCTCAGTTTTTGCAAGAAACTCTGAAGAAATGTTGTCAATTCCTGGATTTGTA 

AACCCTGTCCCAGCCAAAGTACTGCC6TCAGCTCTGTTTATTGAGGATGGTTATGATGCT 

GACGTTAAACTGGCTATATTGTTTACAAAGGCTAATGGAATCCTAGTGAATACCTCCTTT 

GATATTGAGCCTACCTCTCTGAATCATTTTCTTGGAGAAGAGAATTACCCTTCTGTTTAT 

GCTGTTGGCCCCATATTTAACCCGAAGGCCCATCCTCATCCAGATCAAGACCTCGCCTGT 

TGTGACGAGTCGATGAAATGGCTTGATGCTCAACCCGAGGCATCAGTTGTATTCCTTTGT 

TTTGGGAGTATGGGTAGCTTAAGAGGTCCTCTAGTGAAGGAAATAGCACATGGACTTGAG 

CTATGTCAGTATAGATTCCTCTGGTCACTCCGCACAGAAGAAGTGACAAATGATGATCTT 

TTGCCAGAGGGATTCATGGACCGTGTCAGTGGAC6GGGAATGATATGCGGTTGGTCTCCT 

CAGGTGGAAATACTGGCCCATAAAGCAGTGGGAGGTTTTGTTTCTCATTGTGGATGGAAC 

TCAATAGTAGAGAGTTTATGGTTTGGTGTGCCAATTGTGACATGGCCAATGTATGCAGAG 

CAACAGCTCAATGCGTTTCTGATGGTGAAGGAACTGAAGCTCGCAGTGGAGCTGAAACTC 

GATTATAGTGTACATAGTGGTGAGATTGTAAGTGCAAACGAGATAGAGACAGCGATTTCT 

TGTGTAATGAACAAGGATAATAATGTTGTGAGGAAACGAGTGATGGATATCTCGCAGATG 

ATCCAGAGAGCtACGAAGAATGGTGGATCTTCQTTTGCCGCAATTGAGAAATTCATACAT 

GACGTGATAGGAACCAGGACTTAG ■ • 



>UGT72B1 • . [ : — — 

ATGGAGGAATCCAAAACACCTCACGTTGCGATCATACCAAGTCCGGGAATGGGTCATCTC 

ATACCACTCGTCGAGTTTGCTAAACGACTCGTCCATCTTCACGGCCTCACCGTTACCTTC 

GTCATCGCCGGCGAAGGTCCACCATCAAAAGCTCAGAGAACCGTCCTCGACTCTCTCCCT 

TCTTCAATCTCCTCC6TCTTTCTCCCTCCTGTTGATCTCACCGATCTCTCTTCGTCCACT 

CGCATCGAATCTCGGATCTCCCTCACCGTGACTCGTTCAAACCCGGAGCTCCGGAAAGTC 

TTCGACTCGTTCGTGGAGGGAGGTCGTTTGCCAACGGCGCTCGTCGTCGATCTCTTCGGT 

ACGGACGCTTTCGACGTGGCCGTAGAATTTCACGTGCCACCGTATATTTTCTACCCAACA 

ACGGCCAACGTCTTGTCGTTTTTTCTCCATTTGCCTAAACTAGACGAAACGGTGTCGTGT 

GAGTT.CAGGGAATTAACCGAACCGCTTATGCTTCCTGGATGTGTACCGGTTGCCGGGAAA 

GATTTCCTTGACCCGGC'CCAAGACCGGAAAGACGATGCATACAAATGGCTTCTCCATAAC 

ACCAAGAGGTACAAAGAAGCCGAAGGTATTCTTGTGAATACCTTCTTTGAGCTAGAGCCA 

AATGCTATAAAGGCCTTGCAAGAACCGGGTCTTGATAAACCACCGGTTTATCCGGTTGGA 

CCGTTGGTTAACATTGGTAAGCAAGAGGCTAAGCAAACCGAAGAGTCTGAATGTTTAAAG 

TGGTTGGATAACCAGCCGCTCGGTTCGGTTTTATATGTGTCCTTTGGTAGTGGCGGTACC 

CTCACATGTGAGCAGCTCAATGAGCTTGCTCTTGGTCTTGCAGATAGTGAGCAACGGTTT 

CTTTGGGTCATACGAAGTCCTAGTGGGATCGCTAATTCGTCGTATTTTGATTCACATAGC 

CAAACAGATCCATTGACATTTTTACCACCGGGATTTTTAGAGCGGACTAAAAAAAGAGGT 

TTTGTGATCCCTTTTTGGGCTCCACAAGCCCAAGTCTTGGCGCATCCATCCACGGGAGGA 

tttttaactcattgtggatggaattcgactctagagagtgtagtaagcggtattccactt 
atagcatggccattatacgcagaacagaagatgaatgcggttttgttgagtgaagatatt 
cgtgcggcacttaggccgcgtgccggggacgatgggttagttagaagagaagaggtggct 
agagtggtaaaaggattgatggaaggtgaagaaggcaaaggagtgaggaacaagatgaag 

GAGTTGAAGGAAGCAGCTTGTAGGGTGTTGAAGGATGATGGGACTTCGACAAAAGCACTT 
AGTCTTGTGGCCTTAAAGTGGAAAGCCCACAAAAAAGAGTTAGAGCAAAATGGCAACCAC 

TAA . 



•>UGT7 2B2 : _ 

ATGCAAAAAATGGCAGATGGAAACACTCCACATGTAGCAATCATACCAAGTCCCGGTATA 
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GGTCACCTCATCCCACTCGTCGAGTTAGCAAAGCGACTCCTTGACAATCACGGTTTCACC 

GTCACTTTCATCATCCCCGGC6ATTCTCCTCCGTCTAAGGCTCAAAGATCCGTTCTCAAC 

TCTCTCCCTTCCTCCATAGCCTCCGTCTTCCTCCCTCCCGCCGATCTTTCCGACGTTCCT 

TCGACAGCTCGAATCGAAACTCGGATATCGCTCACCGTGACTCGTTCCAACCCGGCGCTC 

CGGGAGCTTTTTGGCTCGTTATCGGCGGAGAAACGTCTCCCGGCGGTTCTCGTCGTCGAT 

CTATTTGGTACGGATGCGTTCGACGTGGCTGCTGAGTTCCACGTGTCGCCATACATTTTC 

TATGCATCAAATGCCAACGTCCTCACGTTTCTGCTTCACTTGCCGAAGCTAGACGAAACG 

GTGTCGTGTGAGTTTAGGGAATTAACCGAACCGGTTATTATTCCCGGTTGTGTCCCCATA 

ACCGGTAAGGATTTCGTCQATCCGTGTCAAGACCGAAAAGATGAATCATACAAATGGCTT 

CTACACAACGTCAAGAGATTCAAAGAAGCTGAAGGGATTCTAGTGAATTCCTTCGTCGAT 

TTAGAGCCAAACACTATAAAGATTGTACAAGAACCGGCTCCTGATAAACCACCGGTTTAC 

CTGATTGGGCCGTTGGTTAACTCGGGTTCACACGATGCTGACGTGAACGATGAGTACAAA 

TGTTTAAATTGGCTAGACAACCAACCATTCGGGTCGGTTCTATACGTATCCTTTGGAAGC 

GGCGGAACACTCACGTTTGAGCAGTTCATT6AGCTGGCTCTTGGCCTAGCGGAGAGTGGA 

AAACGGTTTCTTTGGGTCATACGAAGTCCGAGTGGGATAGCTAGTTCATCGTATTTCAAT 

CCACAAAGCCGAAATGATCCATTTTCGTTTTTACCACAAGGCTTCTTAGACCGAACCAAA 

GAAAAAGGTCTAGTGGTTGGGTCATGGGCTCCACAGGCTCAAATTCTGACTCATACATCT 

ATAGGTGGATTTTTAACTCATTGTGC3ATGGAATTCGAGTCTAGAAAGTATTGTAAACGGT 

GTACCGCTCATAGCATGGCCGTTATACGCG'GAGCAAAAGATGAACGCATTGCTACTCGTG 

GATGTTGGTGCGGCTCTAAGAGCACGACTGGGTGAAGACGGGGTCGTAGGAAGGGAAGAA 

GTGGCGAGAGTGGTAAAAGGATTGATAGAAGGA6AAGAAGGGAATGCGGTAAGGAAAAAA 

ATGAAAGAGTTGAAAGAAGGATCTGTTAGAGTCTTAAGGGACGATGiSATTCTCTACCAAA 

TCGCTTAATGAAGTTTCGTTGAAGTGGAAAGCCCACCAACGAAAGATCGACCAAGAACAG 

GAATCa.TTTCTATQA . ' ' ' 

>UGT72B3 ; ■_ ■ '. : 

ATGAGCATAGATATTTTTCAAGAAATAAGAATAAAGAAAATTCTACTCTTAATGGCGGAA 

GCAAACACTCCACACATAGCAATCATGCCGAGTCCCGGTATGGGTCACCTTATCCCATTC 

GTCGAGTTAGCAAAGCGACTCGTTCAGCACGACTGTTTCACCGTCACAATGATCATCTCC 

GGTGAAACTTCGCCGTCTAAGGCACAAAGATCCGTTCTCAACTCTCTCCCTTCCTCCATA 

GCCTCCGTATTTCTCCCTCCCGCCGATCTTTCCGATGTTCCCTCCACAGCGCGAATCGAA 

ACTCGGGCCATGCTCACCATGACTCGTTCCAATCCGGCGCTGCGGGAGCTTTTTGGCTCT 

TTATCAACGAAGAAAAGTCTCCCGGCGGTTCTCGTCGTCGATATGTTTGGTGCGGATGCG 

TTCGACGTGGCCGTTGACTTCCACGTGTCACCATACATTTTCTATGCATCCAATGCAAAC 

GTCTTGTCGTTTTTTCTTCACTTGCCGAAACTAGACAAAACGGTGTCGTGTGAGTTTAGG 

TACTTAACCGAACCGCTTAAGATTCCCGGCTGTGTCCCGATAACCGGTAAGGACTTTCTT 

GATACGGTTCAAGACCGAAACGACGACGCATACAAATTGCTTCTCCATAACACCAAGAGG 

TACAAAGAAGCTAAAGGGATTCTAGTGAATTCCTTCGTTGATTTAGAGTCGAATGCAATA 

AAGGCCTTACAAGAACCGGCTCCTGATAAACCAACGGTATACCCGATTGGGCCGCTGGTT 

AAdVCAAGTTCATCTAATGTTAACTTGGAAGACAAGTTCGGATGTTTAAGTTGGCTAGAC 

AACCAACCATTCGGCTCGGTTCTATACATATCATTTGGAAGCGGCGGAACACTTACATGT 

GAGCAGTTTAATGAGCTTGCTATTGGTCTTGCGGAGAGCGGAAAACGGTTTATTTGGGTC 

ATACGAAGTCCAAGCGAGATAGTTAGTTCGTCGTATTTCAATCCACACAGCGAGACAGAC 

CCCTTTTC6TTTTTACCAATTGGGTTCTTAGACCGAACCAAAGAGAAAGGTTTGGTGGTT 

CCATCATGGGCTCCACAGGTTCAAATCCTGGCTCATCCATCCACATGCGGGTTTTTAACA 

CACTGTGGATGGAATTCGACCTTAGA?yiGCATTGTAAACGGTGTACCACTCATAGCGTGG 

CCTTTATTCGCGGAGCAAAAGATGAATACATTGCTACTCGTGGAGGATGTTGGAGCGGCT 

CTAAGAATCCATGCGGGTGAAGATGGGATTGTACGGAGGGAAGAAGTGGTGAGAGTGGTG 

AAGGCACTGATGGAAGGTGAAGAGGGAAAAGCCATAGGAAATAAAGTGAAGGAGTTGAAA 

GAAGGAGTTGTTAGAGTCTTGGGTGACGATGGATTGTCCAGCAAGTCATTTGGTGAAGTT 

TTGTTAAAGTGGAAAACGCACCAGCGAGATATgAACCAAGAGACGTCCCACTAA 
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ATGGAACTTCACGGAGCTCTAGTGGCTAGTCCGGGCATGGGACATGCCGTACCCATCTTA 

gaactcggtaaacatctcctgaaccaccacgggttcgaccgtgtcactgtcttcctagtc 

acagacgatgtctca^cgttcgaaatccctaattggaaaaacgttgatggaagaagatcca 

aaatttgtgatcaggtttattccactcgatgtttcgggtcaagatctgagtggttcacta 

ttgactaaactagcagagatgatgaggaaggcattaccagagatcaagtcttcagtcatg 

gagtta6aaccgcggcctagggttttcgtagttgacttgttgggcacggaagctttagag 

gtggctaaggagcttgggatcatgagaaaacatgttctggttactaccagtgcttggttt 

ctagcttttacggtttatatggcgagtcttgacaaacaggagttgtataagcagttgagt 

agcataggagcattgcttatacccggatgcagcccggttaagtttgagcgggctcaagat 

ccgagaaaatatattcgggaactcgctgagtctcagcgtattggggatgaggtgataacc 

gcagatggggtgtttgtgaatacgtggcacagtctggagcaagtgaccatcgggtctttc 

ttggatccagagaatctcggtcgggttatgagaggagtgccggtttatcctgttggaccg 

ctggttagaccagcagaaccaggtttgaaacatggcgtgctggactggcttgacttacaa 

cccaaagagtcagtggtttatgttcttttgggagtggtgggggcactaaccttcgagcag 

acaaacgagctggcttacggtttggagctgactg6ccacagatttgtttgggtagtcaga 

ccy^ccggctgaagacgaccovtcggo^tcaatgttcgacaagaccy^gaatc 

cctctcgatttcttacccaacgggtttctagaccgaaccaaagacatcggtttggtggtc 

cgtacatgggcaccacaagaagagattctgggacacyy^gtcyulcaggagggtttgtgact 

a^ctgcggatggaactd^gttttggagagtattgtgaatggtotgccaatggtagct^ 

ccgttgta.ctgagagcagaagatgaacgcgaggatggtttc:tggggagctaaagattgcg 

ttgcagattaatgttgcagatgggattgtaaagaaggaggtgatagctgaaatggtgaag 

agagtgatggatgaagaagaaggaaaagagatgagaaagaatgttaaggaactgaagaag 

acagcagaagaagctctcaacatgactcacattccatctgcttacttcacctaa 



>UGT72D1 ■ ' ' 

atggaccagcctcacgcgcttctagtggctagccctggcttgggtcacctcatccctatc 

ctggagctcggcaaccgtctctcctccgtcctaaacatccacgtcaccattctcgcggtc 
acctccggctcctcttcaccgacagaaaccgaagccatacatgcagccgcggctagaaca 
atctgtgaaattacggaaattccctcggtggatgtagacaacctcgtggagccagatgct 
acaattttcactaagatggtggtgaagatgcgagccatgaagcccgcggtacgagatgcc 
gtgaaattaatgaaacgaaaaccaacggtcatgattgttgactttttgggtacggaactg 
atgtccgtagccgatgacxstaggcatgacggctaaatacgtttacgttccaactcatgcg 
tggttcttggcagtcatggtgtacttgccggtgttagatacggtagtggaaggtgagtat 
gttgatattaaggagcctttgaagataccgggttgtaaaccggtcggaccgaaggagctg 
atggaaacgatgttagaccggtcgggccagcaatataaagagtgtgtacgagctggctta 
gaggtacctatgagcgatggtgttttggtaaatacttgggaggagttacaaggaaacact 
ctcgctgcgcttagagaggacgaagaattgagccgggtcatgaaagtaccggtttatcct 
attgggcgaattgttaggactaaccagcatgtagacaaacccaatagtatattcgagtgg 
ctagacgagcaacgggaaaggtcagtggtgtttgtgtgtttagggagcggtggaacgttg 
acgtttgagcaaacagtggaactcgctttgggtttagagttaagtggtcaaaggttcgtt 
tgggttctacgtaggcccgcttcatatctcggggcgatctccagcgatgatgaacaggta 
agtgccagtctacctgaaggtttcttggaccgcacgcgtggtgtggggattgtggttacg 
caatgggcaccacaagttgagatcttgagccatagatcgatcggtgggttcttgtctcac 
tgcggttggagttcggctttggaaagtttgactaaaggagttccgatcatcgcttggcct 
ctttatgcggagcagtggatgaatgccacgttattgactgaggagatcggtgtggccgtt 
cgtacatcggagttaccgtcggagagagtcatcggaagggaagaagtggcatctctggtg 
agaaagattatggcggaagaggatgaagaaggacagaaaattagggctaaagctgaggag 
gtgagggttagctccgaacgagcttggagtaaagacgggtcatcttataattctctattc 
gaatgggcaaaacgatgttatcttgtaccgtga 



>UGT72E1 
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ATGAAGATTACAAAACCACATGTGGCCATGTTCGCTAGCCCCGGAATGGGCCACATCATCJ 

CCGGTGATC6AGCTCGGAAAACGCTTAGCTGGTTCCCACGGCTTCGATGTCACCATTTTC 

GTCCTTGAAACCGACGCAGCCTCAGCTCAATCTCAATTCCTTAACTCACCAGGCTGCGAC 

GCGGCCCTTGTTGATATCGTTGGCCTCCCAACGCCCGATATCTCCGGTTTAGTCGACCCA 

TCAGCCTTTTTTGGQATCAAGCTCTTGGTCATGATGCGTGAGACCATTCCTACCATCCGG 

TCAAAGATAGAGGAGATGCAACACAAACCAACGGCTCTGATCGTAGACTTGTTTGGTTTG 

GACGCGATACCGCTCGGTGGTGAGTTCAACATGTTGACTTATATCTTCATCGCTTCAAAC 

GCACGTTTTCTCGCGGTGGCTTTGTTTTTCCCAACGTTGGACAAAGACATGGAAGAAGAG 

CACATAATCAAGAAGCAACCTATGGTTATGCCTGGATGTGAACCGGTTCGGTTTGAAGAT 

ACACTTGAAACATTCCTTGACCCAAACAGCCAACTCTACCGGGAATTTGTTCCTTTCGGT 

TCGGTTTTCCCAACGT6TGATGGTATTATTGTGAATACATGGGATGATATGGAGCCCAAA 

ACTTTGAAATCTCTTCAAGACCCAAAGCTCTTGGGTCGAATTGCTGGTGTACCGGTTTAT 

CCAATTGGTCCTTTGTCTAGACCGGTTGATCCATCTAAAACTAATCATCCGGTTTTGGAT 

TGGTTAAACAAACAGCCGGACGAGTCGGTACTTTACATTTCATTTGGAAGCGGTGGCTCT. 

CTCTCGGCTAAACAACTAACCGAATTGGCTTGGGGACTTGAGATGAGTCAGCAACGGTTC 

GTTTGGGTGGTTCGACCCCCGGTGGACGGTTCAGCTTGCAGTGCATATTTATCCGCTAAC 

AGTGGTAAAATACGAGACGGTACACCTGATTATCTCCCGGAAGGTTTTGTTAGCCGGACT 

CATGAGAGAGGCTTTATGGTCTCTTCTTGGGCTCCCCAAGCGGAGATCTTGGCCCACCAA 

GCCGTAGGTGGGTTTCTAACTCACTGCGGTTGGAATTCGATTCTCGAGAGCGTCGTTGGT 

GGCGTTCCGATGATCGCGTGGCCACTTTTTGCGGAGCAGATGAT6AACGCGACACTCCTC 

AACGAAGAGCTTGGCGTTGCCGTCCGCTCTAAGAAACTACCGTCGGAGGGAGTGATTACG 

AGGGCGGAGATCGAGGCGTTGGTGAGAAAGATCATGGTGGAGGAGGAAGGTGCTGAGATG 

AGAAAGAAGATAAAGAAGCTGAAAGAGACCGCTGCCGAATCGCTGAGTTGCGACGGTGGA 

GTGGCGCATGAATCGTTGTCAAGAATCGGCGACGAGA6CGAGCATCTTTTGGAGCGTGTC 

AGGTGCa.TGGCACGTGGTGCCTAG — 



>U\ji / ^a/i ■ : — — 

ATGCATATCACAAAACCACACG CCGCCATGTTTTCCAGTCCCGGAATGGGCCATGTCATC 

CCGGTGATCGAGCTTGGAAAGCGTCTCTCCGCTAACAACGGCTTCCACGTCACCGTCTTC 

GTCCTCGAAACCGACGCAGCCTCCGCTCAATCCAAGTTCCTAAACTCAACCGGCGTCGAC 

ATCGTCAAACTTCCATCGCCGGACATTTATGGTTTAGTGGACCCCGACGACCATGTAGTG 

ACCAAGATCGGAGTCATTATGCGTGCAGCAGTTCCAGCCCTCCGATCCAAGATCGCTGCC 

ATGCATCAAAAGCCAACGGCTCTGATCGTTGACTTGTTTGGCACAGATGCGTTATGTCTC 

6CAAAGGAATTTAACATGTTGAGTTATGTGTTTATCCCTACCAACGCACGTTTTCTCGGA 

GTTTCGATTTATTATCCAAATTTGGACAAAGATATCAAGGAAGAGCACACAGTGCAAAGA 

AACCCACTCGCTATACCGGGGTGTGAACCGGTTAGGTTCGAAGATACTCTGGATGCATAT 

CTGGTTCCCGACGAACCGGTGTACCGGGATTTTGTTCGTCATGGTCTGGCTTACCCAAAA 

GCCGATGGAATTTTGGTAAATACATGGGAAGAGATGGAGCCCAAATCATTGAAGTCCCTT 

CTAAACCCAAAGCTCTTGGGCCGGGTTGCTCGTGTACCGGTCTATCCAATCGGTCCCTTA 

TGCAGACCGATACAATCAtCCGAAACCGATCACCCGGTTTTGGATTGGTTAAACGAACAA 

CCGAACGAGTCGGTTCTCTATATCTCCTTCGGGAGTGGTGGTTGTCTATCGGCGAAACAG 

TTAACTGAATTGGCGTGGGGACTCGAGCAGAGCCAGCAACGGTTCGTATGGGTGGTTCGA 

CCACCGGTCGACGGTTCCTGTTGTAGCGAGTATGTCTCGGCTAACGGTGGTGGAACCGAA 

GACAACACGCCAGAGTATCTACCGGAAGGGTTCGTGAGTCGTACTAGTGATAGAGGTTTC 

GTGGTCCCCTCATGGGCCCCACAAGCTGAAATCCTGTCCCATCGGGCCGTTGGTGGGTTT 

TTGACCCATTGCGGTTGGAGCTCGACGTTGGAAAGCGTCGTTGGCGGCGTTCCGATGATC 

GCATGGCCACTTTTTGCCGAGCAGAATATGAATGCGGCGTTGCTCAGCGACGAACTGGGA 

ATCGCAGTCAGATTGGATGATCCAAAGGAGGATATTTCTAGGTGGAAGATTGAGGCGTTG 

GTGAGGAAGGTTATGACTGAGAAGGAAGGTGAAGCGATGAGAAGGAAAGTGAAGAAGTTG 

AGAGACTCGGCGGAGATGTCACTGA6CATTGACGGTGGTGGTTTGGCGCACGAGTCGCTT 

TGCAGAGTCACCAAGGAGTGTCAACGGTTTTTGGAACGTGTCGTGGACTTGTCACGTGGT 

GCTTAG 1 - 
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ATGCATATCACAAAACCACACGCCGCCATGTTTTCCAGTCCCGGAATGGGCC^ 

CCGGTGATCGAGCTAGCTAAGCGTCTCTCCGCTAACCACGGCTTCCACGTCACCGTCTTC 

GTCCTTGAAACTGACGCAGCCTCCGTTCAGTCOUIGCTCCTTAACTCAACCGGTGTTGAC 

ATCGTCAACCTTCCATCGCCCGACATTTCTGGCTTGGTAGACCCCAACGCCCATGTGGTG 

ACCAAGATCGGAGTCATTATGCGTGAAGCTGTTCCAACCCTCCGATCCAAGATCGTTGCC 

ATGCATCAAAACCCAACGGCTCTGATCATTGACTTGTTTGGCACAGATGCGTTATGTCTT 

GCAGCGGAGTTAAACATGTTGACTTATGTCTTTATCGCTTCCAACGCGCGTTATCTCGGA 

GTTTCGATATATTATCCAACTTTGGACGAAGTTATCAAAGAAGAGCACACAGTGCAACGA 

AAACCGCTCACTATACCGGGGTGTGAACCGGTTAGATTTGAAGATATTATGGATGCATAT 

CTGGTTCCiGGACGAACCGGTGTACCACGATTTGGTTCGTCACTGTCTGGCCTACCCAAAA 

GCGGATGGAATCTTGGTGAATACATGGGAAGAGATGGAGCCCAAATCATTAAAGTCCCTT 

CAAGACCCGAAACTTTTGGGCCGGGTCGCTCGTGTACCGGTTTATCCGGTTGGTCCGTTA 

TGCAGACCGATACAATCATCCACGACCGATCACCCGGTTTTTGATTGGTTAAACAAACAA 

CCAAACGAGTCGGTTCTCTACATTTCCTTCGGGAGTGGTGGTTCTCTAACGGCTCAACAG 

TTAACCGAATTGGCGTGGGGGCTCGAGGAGAGCCAGCAACGGTTTATATGGGTGGTTCGA 

CCGCCCGTTGACGGCTCGTCTTGCAGTGATTATTTCTCGGCTAAAGGCGGTGTAACCAAA 

GACAACACGCCAGAGTATCTACCAGAAGGGTTCGTGACTCGTACTTGCGATAGAGGTTTC 

ATGATCCCATCATGGGCACCGCAAGCTGAAATCCTAGCCCATCAGGCCGTTGGTGGGTTT 

TTAACACATTGTGGTTGGAGCTCGACGTTGGAAAGCGTCCTTTGCGGCGTTCCAATGATA 

GCGTGGCCGCTTTTCGCCGAGCAGAATATGAACGCGGCGTTGCTTAGCGATGAACTGGGA 

ATCTCTGTTAGAGTGGATGATCCAAAGGAGGCGATTTCTAGGTCGAAGATTGAGGCGATG 

GTGAGGAAGGTTATGGCTGAGGACGAAGGTGAAGAGATGAGAAGGAAAGTGAAGAAGTTG 

AGAGACACGGCGGAGATGTCACTTAGTATTCACGGTGGTGGTTCGGCGCATGAGTCGCTT 

TGCAGAGTCACGAAGGAGTGTCAACGGTTTTTGGAATGTGTCGGGGACTTGGGACGTGGT 

GCTTAG 



>UGT73B1 . 

ATGGGAACTCCTGTCGAAGTCTCTAA6CTCCATTTCTTGCTCTTCCCTTTCATGGCTCAT 

GGCCATATGATACCAACTCTAGACATGGCTAAGCTCTTTGCCACCAAAGGAGCTAAATCC 

ACTATCCTCACTACACCTCTO^TGCCyU^GCTCTTCTTCGAGAAACCCATC^^ 

AACCAAGACAACCCGGGACTCGAAGACATCACCATCCAGATCCTTAATTTCCCTTGCACA 

GAGCTTGGTTTGCCTGATGGCTGTGAGAATACTGATTTCATCTTCTCCACACCTGACCTA 

AACGTAGGTGACTTGAGTCAAAAGTTTTTACTCGCAATGAAATATTTCGAAGAGCCACTA 

GAGGAGGTCCTCGTGACAATGAGACCAGACTGTCTTGTCGGTAACATGTTCTTCCCTTGG 

TCCACTAAAGTTGCTGAGAAGTTCGGAGTACCGAGACTTGTGTTCCACGGCACAGGCTAC 

TTCTCTTTATGTGCTTCTCATTGCATAAGGCTCCCTAAGAATGTGGCAACAAGTTCTGAG 

CCCTTTGTGATTCCTGATCTCCCGGGAGACATTTTGATTACAGAGGAACAGGTCATGGAG 

ACAGAAGAAGAGTCTGTAATGGGGAGGTTTATGAAGGCAATAAGA6ACTCAGAGAGAGAT 

AGCTTTGGCGTGTTGGTGAACAGCTTCTACGAGCTTGAACAGGCTTACTCAGATTATTTC 

AAGAGCTTTGTGGCGAAAAGAGCGTGGCATATCGGTCCGCTTTCCTTAGGAAATAGAAAG 

TTC6AGGA6AAAGCAGAAAGAGGCAAAAAGGCAAGCATTGAT6A6CATGAATGTTTGAAA 

TGGCTCGACTCCAAGAAATGTGATTCAGTGATTTACATGGCCTTTGGAACCATGTCTAGC 

TTTAAAAACGAGCAGCTGATAGAGATTGCAGCTGGTTTAGATATGTCAGGACATGATTTT 

GTCTGGGTGGTTAACAGAAAAGGCAGCCAAGTTGAGAAGGAAGATTGGTTACCAGAGGGG 

TTTGAAGAGAAGACCAAGGGAAAAGGATTGATAATCCGAGGGTGGGCGCCACAAGTGCTG 

ATACTTGAGCACAAAGCAATTGGCGGATTTTTGACGCATTGTGGATGGAACTCGTTATTA 

GAAGGGGTGGCAGCGGGCCTGCCAATGGTGACATGGCCCGTGGGAGCCGAGCAGTTCTAC 

AACGAGAAATTGGTGACACAAGTGTTGAAAACAGGAGTGAGTGTGGGAGTGAAGAAGATG 

ATGCAAGTAGTTGGAGACTTCATTAGCAGAGAGAAAGTGGAGGGAGCGGTGAGGGAAGTG 

ATGGTTGGAGAAGAGAGGAGGAAACGGGCCAAGGAGTTAGCAGAAATGGCGAAAAATGCG 



37 



wo 2004/113540 PCT/GB2004/002432 



GTGAAAGAAGGAGGATCTTCAGATCTAGAGGTAGATAGGTTGATGGAAGAGCTTACGTTA 
GTTAAACTGCAAAAAGAGAAGGTATAA 



>UGT73B2 

ATGGGTAGTGATCATCATCATCGAAAGCTCCACGTTATGTTCTTCCCTTTCATGGCTTAT 

GGTCACATGATACCAACTCTAGACATGGCTAAGCTTTTCTCTAGCAGAGGAGCCAAATCC 

ACAATCCTCy^CCACATCTCTCAACTCayVGATCCTCCAAAAACCaVTCGACAC^^ 

AATCTGAATCCGGGTCTCGAAATCGACATCCAGATCTTCAR.TTTCCCTTGCGTGGAGCTG 

GGGTTACCAGAAGGATGTGAAAACGTTGATTTCTTCACTTCAAACAACAATGATGATAAA 

AACGAGATGATCGTGAAATTCTTTTTCTCGACAAGGTTTTTCAAAGACCAGCTTGAGAAA 

CTCCTCGGGACAACGAGACCAGACTGTCTTATCGCCGACATGTTCTTCC'CCTGGGCTACT 

GAAGCTGCTGGGAAGTTCAATGTGCCAAGACTTGTGTTCCACGGCACTGGCTACTTCTCT 

TTATGCGCTGGTTATTGCATCGGAGTGCATAAACCACAGAAGAGAGTGGCTTCAAGCTCT 

GAGCCATTTGTGATTCCCGAGCTCCCTGGGAACATTGTGATAACTGAAGAACAGATCATA 

GATGGCGATGGAGAATCCGACATGGGAAAGTTTATGACTGAAGTTAGGGAATCGGAAGTG 

AAGAGCTCAGGAGTTGTTTTGAATAGTTTCTACGAGCTAGAACATGATTACGCCGATTTT . 

TACAAAAGTTGTGTACa^AAAGAGAGCGTGGCATATCGGTCCGCTATCGGTT^ 

GGATTTGAGGAGAAGGCTGAGAGAGGAAAGAAAGCGAACATTGATGAGGCTGAATGCCTC 

AAATGGCTTGACTCCAAGAAACCAAATTCAGTCATTTATGTTTCCTTTGGGAGCGTGGCT 

TTCTTCAAGAATGAACAGTTATTCGAGATCGCTGCAGGGTTAGAAGCTTCCGGTACAAGT • 

TTCATTTGGGTTGTTAGGAAAACCAAAGTGATAGAGAAGAATGGTTACCAGAAGGGTTCG 

AAGAGAGGGTGAAAGGGAAAGGTATGATAATAAGAGGATGGGCACCACAGGTGCTGATAC 

TTGACCACCAAGCAACCGGTGGGTTTGTGACCCATTGCGGCTGGAACTCGCTTCTTGAAG 

GAGTGGCTGCAGGGCTACC7VATGGTGAa^TGGCCTGTAGGAGCGGAGC»ATTCTAGAATG 

AGAAATTGGTTACGCAAGTGCTCAGAACAGGAGTGAGCGTGGGAGCGAGCAAGCATATGA ■. 

AAGTTATGATGGGAGATTTCATTAGCAGAGAGAAAGTGGATAAAGCGGTGAGGGAGGTTT ' . 

TGGCTGGGGAAGCAGCAGAGGAGAGGCGGAGACGGGCAAAGAAGCTAGCGGCGATGGCTA 

AAGCTGCCGTGGAAGAAGGAGGGTCTTCCTTCAACGATCTAAACAGCTTCATGGAAGAGT 

TTAGTTCATAA 



>UQT73B3 

ATGAGTAGTGATCCTCATCGTAAGCTCCATGTTGTGTTCTTCCCTTTCATGGCTTATGGT 
CACATGATACCAACTCTAGACATGGCTAAGCTTTTCTCTAGCAGAGGAGCCAAATCTACA 
ATCCTCACCACACCTCTCAACTCCAAGATCTTCCAAAAACCCATCGAAAGATTCAAGAAC 
CTGAATCCGAGTTTCGAAATCGACATCCAGATCTTCGATTTCCCTTGCGTGGATCTCGGG 
TTACCAGAAGGATGCGAAAACGTCGATTTCTTCACCTCAAACAACAATGATGATAGACAG 
TATCTGACCTTGAAGTTCTTTAAGTCGAOyiGGTTTTTGAAAGATCAGCTTGAGAAGCTC 
GTCGAGACAACGAGACCAGACTGTCTTATCGCCGACATGTTCTTCCCCTGGGCTACGGAA 
GCTGCTGAGAAGTTCAATGTGCCAAGACTTGTGTTCCACGGTACTGGCTACTTTTCTTTA 
TGCTCTGAATATTGCATCAGAGTGaiTAACCCAa^AAACyVTAGTAGCTT.CAAGGTACGAG 
CCATTTGTGATTCCTGATCTCCCGGGGAACATAGTGATAACTCAAGAACAGATAGCAGAC 
CGTGACGAAGAAAGCGAGATGGGGAAGTTTATGATTGAGGTCAAAGAATCTGATGTGAAG 
AGCTCAGGTGTTATTGTAAACAGCTTCTACGAGCTTGAACCTGATTACGCCGACTTTTAC 
AAGAGTGTTGTACTGAAGAGAGCGTGGCATATCGGTCCGCTTTCGGTTTACAACAGAGGA 
TTTGAGGAGAAGGCTGAGAGAGGAAAGAAAGCAAGCATTAATGAGGTTGAATGCCTCAAA 
TGGCTTGACTCCAAGAAACCAGATTCAGTCATTTACATTTCTTTTGGGAGCGTGGCTTGC 
TTCAAGAACGAGCAGCTATTCGAGATCGCTGCAGGATTAGAAACTTCTGGAGCAAATTTC 
ATCTGGGTTGTTAGGAAAAACATAGGTATTGAAAAAGAAGAATGGTTACCAGAAGGGTTC 
GAAGAGAGGGTGAAAGGAAAAGGGATGATTATAAGAGGATGGGCACCACAGGTGCTCATA 
CTTGATCATCAAGCAACTTGTGGGTTTGTGACCCATTGCGGCTGGAACTCGCTTCTGGAA 
GGAGTGGCTGCAGGGCTACCAATGGTGACATGGCCTGTAGCAGCGGAGCAATTCTACAAT 
GAGAAATTGGTTACGCAAGTGCTCAGAACAGGAGTGAGCGTGGGAGCGAAAAAGAATGTA 
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AGAACTACGGGAGATTTCATTAGCAGAGAGAAAGTGGTTAAAGCGGTGAGGGAGGTGTTG 
GTTGGGGAAGAGGCGQATGAGAGGCGGGAGAGGGCAAAGAAGTTGGCAGAGATGGCTAAA 
GCTGCCGTGGAAGGAGGGTCTTCTTTCAACGATCTAAACAGCTTCATAGAAGAGTTTACC 
TCGTAA . 



>UGT73B4 

ATGAACAGAGAGCAAATTCATATTTTGTTCTTCCCCTTCATGGCTCATGGCCACATGATT 

CCACTCTTAGACATGGCCaAGCTTTTCGCTAGAAGAGGAGCCaAATC^ 

ACCCCy^TiuyiTGCTAAGATCTTGGAGAAACCCATTGAAGCATTCAAAGTTCAJ^ 

GATCTCGAAATCGGAATCAAGATCCTCAATTTCCCTTGTGTAGAGCTTGGATTGCCAGAA 

GGATGCGAGAACCGTGACTTCATTAACTCATACCaAAAATCTGACTCAT 

TTGAAGTTTCTTTTCTCTACCAAGTATATGAAACAGCAGTTGGAGAGTTTCATTGAAACA 

ACCAAACCGAGTGCTCTTGTA6CCGATATGTTCTTCCCTTGGGCAACAGAATCCGCGGAG 

AAGATCGGTGTTCCyy^GACTTGTGTTCCACGGCACATCATCCTTTGCCT^^^ 

TATAACATGAGGATTCATAAGCCACACAAGAAAGTCGCTTCGAGTTCTACTCCATTTGTA 

ATCCCTGGTCTCCCTGGAGACATAGTTATTACAGAAGACGAAGCCAATGTCACCAACGAA 

GAAACTCCATTCGGAAAGTTTTGGA?^GAAGTCAGGGAATCAGAGACCAGTAGCTTTGGT 

GTTTTGGTGAATAGCTTCTACGAGCTGGAATCATCTTATGCTGATTTTTACCGTAGTTTT 

GTGGCGAAAAAAGCGTGGCATATAGGTCCACTTTCACTATCO^Cy^GAGGGATTGC^ 

AAAGCCGGAAGAGGGAAAAAGGCAAACATTGATGAGCAAGAATGCCTCAAATGGCTTGAC 

TCTAAGACACCTGGCTCAGTAGTTTACTTGTCCTTTGGTAGCGGAACCGGCTTACCCAAC 

GAACAGCTGTTAGAGATTGCTTTCGGCCTTGAAGGCTCTGGACAAAATTTCATTTGGGTG 

GTTAGCAAAAATGAAAACC?^GGTGAAAATGAAGATTGGTTGCCTi^ 

AGGAATAAAGGAAAAGGGCTGATAATACGCGGATGGGCCCCGC^ 

CACAAAGCAATCGGAGGATTTGTGACGCATTGCGGATGGAACTCGACTTTGGAGGGCATT 

GCCGCAGGGCTGCCTATGGTGACTTGGCCGATGGGGGCAGAACAGTTCTACAACGAGA^ 

TTATTGAd\AAAGTGTTGAGAATAGGAGTGAACGTTGGAGCTACCGAGTTGGTGAAAAAA 

GGAAAGTTGATTAGTAGAGCACAAGTGGAGAAGGCAGTAAGGGAAGTGATTGGTGGTGAG 

AA6GCAGAGGAAAGGCGGCTAAGGGCTAAGGAGCTGGGCGAGATGGCTAAAGCCGCTGTG 

GAAGAAGGAGGGTCTTCTTATAATGATGTGAAO^GTTTATGGAAGAGCT^ 

AAGTAG 



>UGT73B5 : 

ATGAACAGAGAAGTCTCTGAGAGAATTCATATTTTGTTCTTCCCCTTCATGGCTCAAGGC 

CACATGATTCCAATTTTGGACATGGCCAAGCTTTTCTCGAGGAGAGGAGCCAAGTCAACC 

CTTCTCACAACCCCAATCAACGCTAAGATCTTCGAGAAACCTATTGAAGCATTCAAAA^ 

Ca^AAACCCTGATCTCGAAATCGGAATCAAGATCTTO^TTTCCCTTGT 

TTGCCTGAAGGATGCGAGAACGCTGACTTTATCAACTCATACCAAAAATCTGACTCAGGT 

GACTTGTTCTTGAAGTTTCTTTTCTCTACCAAGTATATGAAACAACAGTTGGAGAGTTTC 

ATTGAAACAACCAAACay^GTGCTCTTGTTGCCGATATGTTCTTCCCTTGGGCGACAGA^ 

TCTGCTGAGAAGCTCGGTGTACCAAGACTTGTGTTCCACGGTACATCTTTCTTTTCTTTG 

rGTTGTTCGTATAACATGAGGATTCATAAGCCACACAAGAAAGTCGCTACGAGTTCTACT 

CCTTTTGTi^TCCCTGGTCTCCCAGGAGACATAGTTATTACAGAAGACCAAGCCS^TGTT 

GCCAAAGAAGAAACGCCAATGGGAAAGTTTATGAAAGAGGTTAGGGAATCAGAGACCAAT 

AGCTTTGGTGTATTGGTTAATAGCTTCTACGAGCTGGAATCAGCTTATGCTGATTTTTAT 

CGTAGTTTTGTGGCGAAAAGAGCTTGGCATATCGGTCCGCTTTCGCTATCTAACAGAGAG 

TTAGGAGAGAAAGCCAGAAGAGGGAAAAAGGCTAACATTGATGAGCAAGAATGCCTAAAA 

TGGCTGGACTCTAAGACACCTGGTTCAGTAGTTTACTTGTCCTTTGGGAGCGGAACTAAT 

TTCACCAACGACCAGCTGTTAGAGATCGCTTTTGGTCTTGAAGGTTCTGGACAAAGTTTC 

ATCTGGGTGGTTAGGAAAAATGAAAACCAAGGTGACAATGAAGAGTGGTTGCCTGAAGGG 

TTTAAAGAGAGGACAACAGGGAAAGGGCTAATAATACCTGGATGGGCGCCGCAAGTGCTG 

ATACTTGACCATAAAGCAATTGGAGGATTTGTGACTCATTGCGGATGGAACTCGGCTATA 
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GAGGGCATTGCCGCGGGGCTGCCTATGGTAACATGGCCAATGGGGGCAGAACAGT^ 

AATGAQAAGCTATTGACAAAAGTGTTGAGAATAGGAGTGAACGTTGQAGCTACCGAGTTG 

GTGAAAAAAGGAAAGTTGATTAGTAGAGCACaAGTGGAGAAGGCAGTAAGGGAAGTGATT 

GGTGGTGAGAAGGCAGAGGAAAGGCGGCTATGGGCTAAGAAGCTGGGCGAGATGGCTAAA 

GCCGCTGTGGAAGAAGGAGGGTCCTCTTATAATGATGTGAACAAGTTTATGGAAGAGCTG 

AATGGTAGAAAGTAG 



>UGT73C1 

ATGGCATCGGAATTTCGTCCTCCTCTTCATTTTGTTCTCTTCCCTTTCATGGCTCAAGGC 

CA(a.TGATCCCAATGGTAGATATTGCAAGGCTCCTGGCTCAGCGCGGGGTGACTATAACC 

ATTGTCACTACy^CCTCAAAACGCAGGCCGGTTCAAGAACGTTCTTAGCCG^ 

TCCGGCTTGCCCATCAATCTCGTGCAAGTAAA6TTTCCATCTCAAGAATCGGGTTCACCG 

GAAGGACAGGAGAATTTGGACTTGCTCGATTCATTGGGGGCTTCATTAACCTTCTTCAAA 

GCATTTAGCCTGCTCGAGGAACCAGTCGAGAAGCTCTTGAAAGAGATTCAACCTAGGCCA 

AACTGCATAATCGCTGACATGTGTTTGCCTTATACAAACAGAATTGCCAAGAATCTTGGT 

ATACCAAAAATCATCTTTCATGGCATGTGTTGCTTCAATCTTCTTTGTACGCACATAATG 

CACCAAAACCACGAGTTCTTGGAAACTATAGAGTCTGACAAGGAATACTTCCCCATTCCT 

AATTTCCCTGACAGAGTTGAGTTCACAAAATCTCAGCTTCCAATGGTATTAGTTGCTGGA 

GATTGGAi^GACTTCCTTGACGGAATGACAGAAGGGGATAACACTTCTT^ 

GTTAACACGTTTGAAGAGCTCGAGCCAGCTTATGTTAGAGACTAGAAGAAGGTTAAAGCG 

GGTAAGATATGGAGCATCGGACCGGTTTCCTTGTGCAACAAGTTAGGAGAAGACCAAGCT 

GAGAGGGGAAACAAGGCGGACATTGATCAAGACGAGTGTATTAAATGGCTTGATTCTAAA 

QAAGAAGGGTCGGTGCTATATGTTTGCCTTGGAAGTATATGCAATCTTCCTCTGTCTCAG 

CTCAAAGAGCTCGGCTTAGGCCTCGAGGAATCCCAAAGACCTTTCATTTGGGTCATAAGA 

GGTTGGGAGAAGTATAACGAGTTACTTGAATGGATCTCAGAGAGCGGTTATAAGGAAAGA 

ATCAAAGAAAGAGGCCTTCTCATAACAGGATGGTCGCCTCAAATGCTTATCCTTACACA 

CCTGCCGTTGGAGGATTCTTGACACATTGTGGATGGAACTCTACTCTTGMGGAATC^^ 

TCAGGCGTTCCATTACTCT^CGTGGCCACTGTTTGGAGACCAATTCTGCAATGAGAAATTG 

GCGGTGCAGATACTAAAAGCCGGTGTGAGAGCTGGGGTTGAAGAGTCCATGAGATGGGGA 

GAAGAGGAGAAAATAGGAGTACTGGTGGATAAAGAAGGAGTAAAGAAGGCAGTGGAGGAA 

TTGATGGGTGATAGTAATGATGCTAAGGAGAGAAGAAAAAGAGTGAAAGAGCTTGGAGAA 

TTAGCTCACAAGGCTGTGGAAGAAGGAGGCTCTTCTCATTCCAACATCACATTCTTGCTA 

CAAGACATAATGCAATTAGAACAACCCAAGAAATGA 



>UGT73C2 

ATGGCTTTCGAGAAGACCCGGCAATTTCTTCCTCGGCTTCACTTTGTTCTCTTCCCTTTC 

ATGGCTCAAGGCCACATGATCCCCATGGTGGATATTGCAAGGATCTTGGCTCAGCGCGGG 

GTGACTATTACCATTGTCACGACGCCTCACAACGCAGCCAGGTTCAAAGATGTCCTAAAC 

CGGGCCATCCAGTCAGGCTTGCACATTAGGGTTGAGCATGTGAAGTTTCCTTTTCAAGAA 

GCTGGTTTGCAAGAAGGACaAGAGAATGTTGATTTTCTTGACTCAATGGAGTTAATGGTA 

CATTTCTTTAAAGCGGTTAACATGCTTGAAAATCCGGTCATGAAGCTCATGGAAGAGATG 

AAACCTAAACCAAGCTGCCTAATTTCTGATTTTTGTTTGCCTTATACL?yiGC:AAA^ 

AAGAGGTTCAATATCCCAAAGATCGTTTTCCATGGCGTGTCTTGCTTTTGTCTTTTGAGT 

ATGCATATTCTACACCGAAACCACAATATCTTACATGCTTTAAAGTCGGACAAAGAGTAT 

TTCTTGGTTCCTAGTTTTCCAGATAGAGTTGAATTTACAAAGCTTCAAGTTACTGTGAAA 

ACAAACTTTAGTGGAGATTGGAAAGAGATCATGGACGAACAGGTGGATGCTGATGACACG 

TCCTATGGTGTAATTGTCAACACATTTCAGGATTTGGAGTCTGCCTATGTGAAAAACTAC 

ACGGAGGCTAGGGCTGGTAAAGTATGGAGCATCGGTCCGGTTTCCTTGTGCAACAAGGTA 

GGAGAAGACAAAGCTGAGAGGGGAAACAAGGCAGCCATTGATCAAGACGAGTGTATTAAA 

TGGCTTGATTCTAAAGATGTAGAGTCGGTGCTGTATGTTTGCCTTGGAAGTATATGCAAT 

CTTCCTCTGGCTCAGCTTAGAGAGCTCGGGCTAGGCCTCGAGGCAACTAAAAGACCATTC 

ATTTGGGTCATAAGAGGTGGGGGAAAGTATCATGAACTAGCTGAGTGGATCTTAGAGAGC 
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GGTTTTGAAGAAAGAACCAAAGAGAGAAGCCTTCTCATAAAAGGATGGTCGCCTCAAATG 
CTTATCCTTTCyVCACCCTGCCGTTGGAGGATTCCTGACACATTGTGGATGGAACTCAACT 
TTAGAAGGAATCACCTCAGGGGTTCCATTGATCACTTGGCCATTATTTGGAGACCAATTC 
TGCAACCAGAAACTGATCGTGCAGGTGCTAAAAGCAGGTGTAAGTGTTGGGGTTGAAGAG 
GTCATGAAATGGGGAGAAGAGGAGAGTATTGGAGTGTTAGTGGATAAAGAAGGAGTGAAG 
AAGGCAGTGGACGAAATAATGGGCGAGAGTGATGAAGCAAAAGAGAGAAGAAAAAGAGTC 
AGAGAGCTTGGAGAATTAGCTCACAAGGCTGTGGAAGAAGGAGGCTCTTCTCATTCTAAT 
ATCATATTTTTGCTACAAGATATAATGCaACAAGTAGAATCCAAGAGTTGA 



>UGT73C3 [ . 

ATGGCTACGGAAAAAACCCACCAATTTCATCCTTCTCTTCACTTTGTCCTCTTCCCTTTC 

ATGGCTCAAGGCCACATGATTCCCATGATTGATATTGCAAGACTCTTGGCTCAGCGTGGT 

GTGACCATAACAATTGTCACGACACCTCACAACGCAGCAAGGTTTAAGAATGTCCTAAAC 

CGAGCGATCGAGTCTGGCTTGGCCATCAACATACTGCATGTGAAGTTTCCATATCAAGAG 

TTTGGTTTGCCAGAAGGAAAAGAGAATATAGATTCGTTAGACTCAACGGAGTTGATGGTA 

CCTTTGTTCAAAGCGGTGAACTTGCTTGAAGATCCGGTCATGAAGCTCATGGAAGAGATG 

AAACCTAGACCTAGCTGTCTAATTTCTGATTGGTGTTTGCCTTATACAAGCATAATCGCC 

AAGAACrra^TATACCS^iSATAGTTTTCCACGGCATGGGTTGCTTTAATCTTTT^^^ 

ATGCATGTTCTACGCAGAAACTTAGAGATCCTAGAGAATGTAAAGTCGGATGAAGAGTAT 

TTCTTGGTTCCTAGTTTTCCTGATAGAGTTGAATTTACAAAGCTTCAACTTCCTGTGAAA 

GO^TGCTAGTGGAGATTGGAAAGAGATAATGGATGAAATGGTAAAAGCAGAATAC^ 

TCCTATGGTGTGATCGTCAACACATTTCAGGAGTTGGAGCCACCTTATGTCAAAGACTAC 

AAAGAGGCAATGGATGGAAAAGTATGGTCCATTGGACCCGTTTCCTTGTGTAACAAGGCA 

GGTGCAGACAAAGCTGAGAGGGGAAGCAAGGCCGCCATTGATCAAGATGAGT6TCTTCAA 

TGGCTTGATTCTAAAGAAGAAGGTTCGGTGCTCTATGTTTGCCTTGGAAGTATATGTAAT 

CTTCCTTTGTCTCA6CTCAAGGAGCTGGGGCTAGGCCTTGAGGAATCTCGAAGATCTTTT 

ATTTGGGTCATAAGAGGTTCGGAAAAGTATAAAGAACTATTTGAGTGGATGTTGGAGAGC 

GGTTTTGAAGAAAGAATCAAAGAGAGAGGACTTCTCATTAAAGGGTGGGCACCTCAAGTC 

CTTATCCTTTCACATCCTTCCGTTGGAGGATTCCTGACACACTGTGGATGGAACTCGACT 

CTCGAAGGAATCACCTCa^GGaiTTCCACTGATCACTrGGCCGCTGTTTGGAGACCAATTC 

TGCAACCAAAAACTGGTCGTTCAAGTACTAAAAGCCGGTGTAAGTGCCGGGGTTGAAGAA 

GTCATGAAATGGGGAGAAGAAGATAAAATAGGAGTGTTAGTGGATAAAGAAGGAGTGAAA 

AAGGCTGTGGAAGAATTGATGG6TGATAGTGATGATGCAAAAGAGAGGAGAAGAAGAGTC 

AAAGAGCTTGGAGAATTAGCTCACAAAGCTGTGGAAAAAGGAGGCTCTTCTCATTCTAAC 

ATCACACTCTTGCTACAAGACATAATGCAACTAGCACAATTCAAGAATTGA 



>UGT73C4 ■ 

ATGGCTTCCGAAAAATCCCACAAAGTTCATCCTCCTCTTCACTTTATTCTTTTCCCTTTC 
ATGGCTCAGGGCCACATGATTCCCATGATTGATATAGCAAGGCTCTTGGCTCAGCGCGGT 
GCGACAGTAACTATTGTOVCGACACGTTATAAfGCAGGGAGGTTCGAGAATGTCTTAAGT 
CGTGCCATGGAGTCTGGTTTACCCATCAACATAGTGCATGTGAATTTTCCATATCAAGAA 
TTTGGTTTGCCAGAAGGAAAAGAGAATATAGATTCGTATGACTCAATGGAGCTGATGGTA 
CCTTTCTTTCAAGCAGTTAACATGCTCGAAGATCCGGTCATGAAGCTCATGGAAGAGATG 
AAACCTAGACCTAGCTGTATTATTTCTGATTTGCTCTTGCCTTATACAAGCAAAATCGCA 
AGGAAATTCAGTATACCAAAGATAGTTTTCCACGGCACGGGTTGCTTTAATCTTTTGTGT 
ATGCATGTTCTACGCAGAAACCTCGAGATCTTGAAGAACTTAAAGTCGGATAAAGATTAT 
TTCCTGGTTCCTAGTTTTCCTGATAGAGTTGAATTTACAAAGCCTCAAGTTCCAGTGGAA 
ACAACTGCAAGTGGAGATTGGAAAGCGTTCTTGGACGAAATGGTAGAAGCAGAATACACA 
TCCTATGGTGTGATCGTCAACACATTTCAGGAGTTGGAGCCTGCTTATGTCAAAGACTAC 
ACGAAGGCTAGGGCTGGAAAAGTATGGTCCATTGGACCTGTTTCCTTGTGCAACAAGGCA 
GGTGCTGATAAAGGTGAGAGGGGAAACCAGGCCGCCATTGATCAAGATGAGTGTCTTCAA 
TGGCTTGATTCTAAAGAAGATGGTTCGGTGTTATATGTTTQCCTTGGAAGTATCTGTAAT 
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CTACCTTTGTCTCAGCTCAAGGAGCTGGGGCTAGGCCTTGAAAAATCCCJ^GATCTTTT 
ATTTGGGTCATAAGAGGTTGGGAAAAGTATAATGAACTATATGAGTGGATGATGGAGAGC 
GGTTTTGAAGAAAGAATCAAAGAGAGAGGACTTCTTATTAAAGGGTGGTCACCTCAAGTC 
CTTATCCTTTCACATCCTTCCGTTGGAGGATTCCTGACACACTGTGGATGGAACTCGACT 
CTCGAAGGAATCACCTCAGGCATTCCACTGATCACTTGGCCGCTGTTTGGAGACCAATTC 
TGCAACCAAAAACTGGTCGTTCAAGTACTAAAAGCCGGTGTAAGTGCCGGGGTTGAAGAA 
GTCATGAAATGGGGAGAAGAGGAGAAAATAGGAGTGTTAGTGQATAAAGAAGGAGTAAAG 
AAGGCAGTGGAAGAGTTAATGGGTGCGAGTGATGATGCAAAAGAGAGGAGAAGAAGAGTC 
AAAGAGCTTGGAGAATCAGCTCACAAGGCTGTGGAAGAAGGAGGCTCTTCTCATTCTAAC 
ATCACATACTTGCTACAAGACATAATGCAACAAGTGAAATCCAAGAACTGA 



>UGT73C5 

ATGGTTTCCGAAACAACCAAATCTTCTCCACTTCACTTTGTTCTCTTCCCTTTCATGGCT 

CAAGGCCACATGATTCCCATGGTTGATATTGCAAGGCTCTTGGCTCAGCGTGGTGTGATC 

ATAACAATTGTCACGACGCCTCACAATGCAGCGAGGTTCAAGAATGTCCTAAACCGTGCC 

ATTGAGTCTGGCTTGCCCATCAACTTAGTGCAAGTCAAGTTTCCATATCTAGAAGCTGGT 

TTGCAAGAAGGACAAGAGAATATCGATTCTCTTGACACAATGGAGCGGATGATACCTTTC 

TTTAAAGCGGTTAACTTTCTCGAAGAACCAGTCCAGAAGCTCATTGAAGAGATGAACCCT 

CGACCAAGCTGTCTAATTTCTGATTTTTGTTTGCCTTATACy^GCAAAATCGCCAAGAAG 

TTCAATATCCCAAAGATCCTCTTCCATGGCATGGGTTGCTTTTGTCTTCTGTGTATGCAT 

GTTTTACGCAAGAACCGTGAGATCTTGGACAATTTAAAGTCAGATAAGGAGCTTTTCACT 

GTTCCTGATTTTCCTGATAGAGTTGAATTCACAAGAACGCAAGTTCCGGTAGAAACATAT 

GTTCCAGCTGGAGACTGGAAAGATATCTTTGATGGTATGGTAGAAGCGAATGAGACATCT 

TATGGTGTGATCGTa^CTCATTTCy^GAGCTaSAGCCTGCTTATGCCAAAGACTACAAG 

GAGGTAAGGTCCGGTAAAGCATGGACCATTGGACGCGTTTCCTTGTGCAAGAAGGTAGGA 

GCCGACAAAGCAGAGAGGGGAAACAAATCAGACATTGATCAAGATGAGTGCCTTAAATGG 

CTCGATTCTAAGAAACATGGCTCGGTGCTTTACGTTTGTCTTGGAAGTATCTGTAATCTT 

CCTTTGTCTCAACTCAAGGAGCTGGGACTAGGCCTAGAGGAATCCCAAAGACCTTTCATT 

TGGGTCATAAGAGGTTGGGAGAAGTACAAAGAGTTAGTTGAGTGGTTCTCGGAAAGCGGC 

TTTGAAGATAGAATCCAAGATAGAGGACTTCTCATCAAAGGATGGTCCCCTCAAATGCTT 

ATCCTTTCACATCCATCAGTTGGAGGGTTCCTAACACACTGTGGTTGGAACTCGACTCTT 

GAGGGGATAACTGCTGGTCTACCGCTACTTACATGGCCGCTATTCGCAGACCAATTCTGC 

AATGAGAAATTGGTCGTTGAGGTACTAAAAGCCGGTGTAAGATCCGGGGTTGAACAGCCT 

ATGAAATGGGGAGAAGAGGAGAAAATAGGAGTGTTGGTGGATAAAGAAGGAGTGAAGAAG 

GCAGTGGAAGAATTAATGGGTGAGAGTGATGATGCAAAAGAGAGAAGAAGAAGAGCCAAA 

GAGCTTGGAGATTCAGCTCACAAGGCTGTGGAAGAAGGAGGCTCTTCTCATTCTAACATC 

TCTTTCTTGCTACAAGACATAATGGAACTGGCAGAACCCAATAAfTGA 

>UGT73C6 • ... ■ 

ATGGCTTTCGAAAAAAACAACGAACCTTTTCCTCTTCACTTTGTTCTCTTCCCTTTCATG 

GCTCAAGGCCACATGATTCCCATGGTTGATATTGCAAGGCTCTTGGCTCAGCGAGGTGTG 

CTTATAACAATTGTCACGACGCCTCACAATGCAGCAAGGTTCAAGAATGTCCTAAACCGT 

GCCATTGAGTCTGGTTTGCCCATCAACCTAGTGCAAGTCAAGTTTCCATATCAAGAAGCT 

GGTCTGCAAGAAGGACAAGAAAATATGGATTTGCTTACCACGATGGAGCAGATAACATCT 

TTCTTTAAAGCGGTTAACTTACTCAAAGAACCAGTCCAGAACCTTATTGAAGAGATGAGC 

CCGCGACCAAGCTGTCTAATCTCTGATATGTGTTTGTCGTATACAAGCGAAATCGCCAAG 

AAGTTCAAAATACCAAAGATCCTCTTCCATGGCATGGGTTGCTTTTGTCTTCTGTGTGTT 

AA.CGTTCTGCGCAAGAACCGTGAGATCTTGGACAATTTAAAGTCTGATAAGGAGTACTTC 

ATTGTTCCTTATTTTCCTGATAGAGTTGAATTCACAAGACCTCAAGTTCCGGTGGAAACA 

TATGTTCCTGCAGGCTGGAAAGAGATCTTGGAGGATATGGTAGAAGCGGATAAGACATCT 

TATGGTGTTATAGTCAACTCATTTCAAGAGCTCGAACCTGCGTATGCCAAAGACTTCAAG 

GAGGCAAGGTCTGGTAAAGCATGGACCATTGGACCTGTTTCCTTGTGCAACAAGGTAGGA 
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GTAGACAAAGCAGAGAGGGGAAACAAATCAGATATTGATCAAGATGAGTGCCTTGAATGG 

CTCGATTCTAAGGAACCGGGATCTGTGCTCTACGTTTGCCTTGGAAGTATTTGTAATCTT 

CCTCTGTCTCAGCTCCTTGAGCTGGGACTAGGCCTAGAGGAATCCCAAAGACCTTTCATC 

TGGGTCATAAGAGGTTGGGAGAAATACAAAGAGTTAGTTGAGTGGTTCTCGGAAAGCGGC 

TTTGAAGATAGAATCO^GATAGAGGACTTCTCATCAAAGGATGGTCCCCTCAAATGCTT 

ATCCTTTCACATCCTTCTGTTGGAGGGTTCTTAACGCACTGCGGATGGAACTCGACTCTT 

GAGGGGATAACTGCTGGTCTACCAATGCTTACATGGCCACTATTTGCAGACCAA 

AACGAGAAACTGGTCGTACAAATACTAAAAGTCGGTGTAAGTGCCGAGGTTAAAGAGGTC 

ATGAAATGGGGAGAAGAAGAGAAGATAGGAGTGTTGGTGGATAAAGAAGGAGTGAAGAAG 

GCAGTGGAAGAACTAATGGGTGAGAGTGATGATGCAAAAGAGAGAAGAAGAAGAGCCAAA 

GAGCTTGGAGAATCAGCTCACAAGGCTGT6GAAGAAGGAGGCTCCTCTCATTCTAATATC 

ACTTTCTTGCTACAAGACATAATGCAACTAGCACAGTCCAATAATTGA ' 



>UGT73C7. ■ ■ 

ATGTGTTCTCATGATCCTCTTCACTTCGTCGTAATACCCTTTATGGCCCAAGGCCATATG 

ATCCCATTGGTCGACATCTCTAGGCTCTTGTCCCAGCGCCAAGGCGTGACTGTCTGCATC 

ATCACAACTACTCAAAATGTAGCCAAGATCAAGACTTCACTCTCATTTTCCTCTTTGTTT 

GCGACTATCAACATCGTTGAAGTTAAGTTTCTGTCTCAACAAACGGGTTTGCCAGAAGGG 

TGCGAGAGTTTAGATATGTTGGCTTCAATGGGCGATATGGTGAAGTTCTTTGATGCTGCC 

AACTCACTTGAGGAGCAiVGTTGAGAAAGCTATGGAAGAGATGGTTCAGCCGCGGCCAAGC 

TGCATCATTGGAGACATGAGCCTTCCTTTCACTTO^GACTTGCCAAGAAATTC^ . 

CCCAAACTTATCTTCCATGGGTTTTCTTGTTTCAGCCTCATGTCTATACAAGTGGTTCGA 

GAAAGCGGGATCTTGAAAATGATAGAATCAAACGACGAGTATTTTGATTTGCCCGGCTTG 

CCTGACAAAGTTGAGTTCACGAAACCTCAGGTCTCTGTGTTGCAACCTGTTGAAGGA^ • 

ATGAAAGAGAGTACGGCCAAGATTATTGAAGCTGATAATGACTCTTA'TGGTGTTATTGTG 

AACACTTTTGAAGAGTTAGAGGTTGATTATGCAAGAGAATATAGGAAAGCAAGGGCTGGA 

AAAGTTTGGTGCGTTGGACCTGTTTCCTTGTGCAATAGGTTAGGGTTAGACAAAGCTAAA 

AGAGGAGATAAGGCTTCTATTGGTCAAGACCAATGTCTTCAATGGCTTGACTCTCAAGAA 

ACTGGTTCAGTGCTCTACGTTTGCCTTGGAAGTCTATGTAATCTTCCCTTGGCTCAGCTC 

A?^GAGCTGGGACTAGGCCTTGAGGCATCTAATAAACCTTTCATATGGGTTATAAGAGAA 

TGGGGAAAATATGGAGATTTAGCAAATTGGATGCAACAAAGCGGATTTGAAGAGCGGATC 

AAAGATAGAGGACTGGTGATCAAAGGTTGGGCGCCGCAAGTTTTCATCCTCTCACACGCA 

TCCATTGGAGGGTTTTTGACTCACTGTGGATGGAACTCGACACTAGAAGGAATTACTGCA 

GGAGTTCCATTATTGACATGGCCTTTGTTTGCTGAACAATTCTTGAATGAGAAGTTAGTT 

GTGCAGATACTAAAAGCAGGGTTAAAGATAGGAGTAGAGAAATTGATGAAATATGGAAAA 

GAAGAGGAGATAGGAGCGATGGTGAGCAGAGAATGTGTGAGA?y^GCTGTGGATGAGCTA 

ATGGGTGATAGTGAAGAAGCAGAAGAGAGAAGAAGAAAAGTTACAGAACTTAGTGACTTG 

GCAAATAAGGCTTTGGAAAAAGGAGGATCTTCAGATTCTAATATCACATTGCTCATTCAA 

GATATTATGGAGCAATCACA?^TCAATTTTAA • 



>UGT73D1 

ATGGAATCAAAAATAGTTTCAAAAGCCAAAAGACTTCACTTTGTTTTGATCCCTCTCATG 
GCTCAAGGGCATCTGAT.CCCCATGGTCGACATCTCCAAGATTCTTGCACGACAAGGCAAC 
ATCGTTACCATAGTTACAACCCCTCAAAATGCTTCTAGGTTTGCGAAGACAGTTGACCGA 
GCAAGATTAGAGTCGGGTCTCGAAATCAA.TGTCGTTAAATTTCCAATTCCTTACAAAGAA 
TTCGGTCTTCCCAAAGATTGTGAGACTCTGGACACTTTGCCCTCCAAAGACCTCCTACGA 
AGATTCTATGAC6CTGTGGATAAA.CTCCAAGAGCCCATGGAACGGTTTCTTGAGCAACAA 
GATATCCCTCCAAGTTGCATAATCTCCGATAAATGCCTTTTTTGGACGTCAAGAACCGCA 
AAGAGGTTCAAAATCCCGAGGATCGTGTTCCATGGAATGTGTTGCTTCTCTCTTTTGAGT 
TCGCACAATATCCATCTTCATAGCCCGCACCTCTCGGTTTCTTCGGCCGTAGAGCCATTC 
CCTATACCAGGAATGCCACATAGGATTGAGATAGCTAGAGCTCAGTTACCTGGTGCTTTT 
GAGAAGTTAGCAAATATGGATGACGTTCGCGAGAAGATGCGTGAATCTGAATCAGAAGCC 
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TTTGGGGTTATTGTTAATAGCTTCCAGGAATTGGAGCCTGGCTATGCAGAGGCCTACGCT 
GAGGCCATCAATAAGAAGGTATGGTTCGTTGGACCCGTTTCTTTATGCAACGACCGTATG 
GCTGACCTATTCGATAGAGGAAGTAATGGTAACATCGCAATAAGCGAGACCGAATGCTTG 
CAGTTTCTTGACTCGATGAGACCAAGGTCAGTCTTATATGTTTCTCTTGGTAGCCTCTGT 
CGACTAATACCTAATCAATTQATAGAACTAGGTTTAGGGTTAGAAGAATCGGGAAAACCC 
TTTATTTGGGTGATAAAGACCGAGGAAAAACACATGATTGAGCTAGACGAATGGCTAAAA 
CGCGAAAATTTTGAAGAGCGAGTTAGAGGAAGAGGGATAGTAATAAAGGGTTGGAGTCCT 
CAGGCTATGATACTCTCACATGGTTCAACCGGCGGGTTCTTGACTCATTGCGGTTGGAAT 
TCTACAATAGAAGCGATATGTTTTGGTGTACCAATGATCACATGGCCGTTGTTCGCTGAA 

caatttctcaatgagaaactcatcgtggaggttttgaacatcgggSttagggttggggtg 
gagattccggtgagatggg6agacgaggagagacttggagtgttggtcaagaaaccgagt 
gttgtgaaagctataaagcttttgatggaccaagattgtcaacgtgtagacgaaaatgat 

GATGATAATGAATTCGTGAGACGAAGGAGACGTATTCAAGAACTTGCAGTAATGGCGAAA 

aaggctgtggaagaaaagggatcttcgagtattaacgtttcaattttaatccaagatgtt 

TTGGAGCAATTGAGTCTCGTGTAG 



>UGT 74B1 , • . 

ATGGCGGAAACAACTCCCAAAGTGAAAGGCCACGTCGTAATCTTACCATACCCAGTTCAA 

GGCCACCTAAACCCAATGGTTCAATTCGCTAAACGTCTAGTCTCCAAAAACGTCAAAGTC 

ACAATCGCCACCACTACCTACACCGCCTCCTCAATCACAACACCATCACTCTCCGTCGAA 

CCAATCTCCGATGGATTCQATTTCATCCCCATAGGTATCCCCGGTTTCAGCGTCGATACT 

TACTCAGAATCCTTCAAGCTCAACGGATCCGAAACCCTAACTCTCCTAATCGAGAAATTC 

AAATCCACAGATTCACCAATCGATTGCTTAATCTACGATTCGTTTCTTCCTTGGGGACTT 

GAAGTTGCTAGATCTATGGAACTTTCAGCTGCTTCTTTCTTCACTAATAATCTCACTGTT 

TGTTCTGTGTTGCGTAAATTCTCTAACGGTGACTTTCCTCTTCCCGCTGATCCTAATTCG 

GCGCCGTTTCGTATCCGTGGCTTACCGTCTTTGAGCTACGATGAGTTACCTTCGTTTGTG 

GGACGTCATTGGTTGACTCATCCTGAGCATGGCAGAGTTCTTCTGAATCAGTTTCCTAAC 

CATGAAAATGCTGATTGGTTATTCGTTAATGGCTTTGAAGGGTTAGAAGAAACACAAGAT 

TGTGAAAATGGTGAGTCTGATGCAATGAAGGCGACGTTGATCGGACCGATGATTCCATCG 

GCTTATCTTGATGATCGGATGGAAGATGATAAAGACTATGGTGCGAGTCTGTTGAAACCG 

ATATCGAAGGAGTGTATGGAGTGGCTTGAGACTAAGCAGGCTCAGTCAGTAGCATTTGTT 

TCGTTTGGTTCGTTTGGGATTCTCTTTGAGAAGCAACTTGCAGAGGTAGCTATTGCGCTA 

CAAGAATCGGATTTGAACTTCTTGTGGGTGATTAAAGAAGGTCATATAGCGAAATTGCC* 

GAAGGGTTTGTGGAATCGACTAAAGATAGAGCCTTGTTGGTTTCTTGGTGTAACCAGCTT 

GAGGTTTTAGCTCATGAATCGATAGGTTGCTTTTTGACTCATTGTGGTTGGAACTCTACG 

TTGGAAGGGTTGAGTTTGGGAGTTCCGATGGTTGGTGTGCCTCAGTGGAGTGATCAGATG 

AATGATGCTAAGTTTGTGGAGGAAGTTTGGAAAGTTGGGTATAGAGCGAAAGAGGAAGCT 

GGGGAAGTAATCGTGAAGAGTGAAGAATTGGTGAGGTGTTTGAAAGGAGTGATGGAAGGA 

GAGAGTAGTGTGAAGATTAGAGAGAGTTCGAAGAAGTGGAAAGATTTGGCTGTGAAGGCA 

ATGAGT6AAGGAGGAAGCTCTGATCGAAGCATTAACGAGTTTATAGAGAGTTTAGGGAAG 

TAA ' 



>UGT7 4C1 . 

ATGAGTGAAGCAAAGAAGGGTCACGTACTGTTTTTTCCATATCCATTACAAGGCCACATT 

AACCCAATGATCCAACTCGCTAAACGCTTATCCAAAAAGGGCATCACCAGCACACTCATC 

ATCGCCTCCAAAGACCACCGTGAACCTTACACCTCC6ACGACTACTCCATCACCGTCCAC 

ACCATCCAC6ACGGTTTCTTTCCACATGAACACCCTCACGCCAAGTTCGTAGATCTTGAC 

CGTTTCCACAACTCTACTTCTCGAAGCCTGACCGATTTCATCTCTAGTGCGAAGTTGTCG 

GACAATCCTCCAAAAGCTCTGATCTATGATCCATTTATGCCCTTTGCATTGGACATAGdC 

AAGGACTTG6ATCTATACGTAGTGGCATATTTCACTCAACCATGGTTGGCTAGTCTTGTT 

TACTACCATATCAACGAAGGCACCTACGATGTTCCCGTTGATAGACACGAGAACCCAACA 

CTTGCATCGTTTCCTGGTTTCCCATTGTTAAGCCAAGATGATCTGCCTTCGTTCGCCTGC 
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GAAAAAGGGTCGTACCCTCTTCTACACGAGTTTGTGGTTAGGCAATTCTCTAATTTATTG 

CAAGCTGATTGCATTCTCTGCAACACTTTTGATCAACTTGAACCAAAGGTAGTG 

ATGAATGATCAATGGCCGGTGAAGAACATTGGACCGGTGGTTCCATCGAAGTTCTTGGAT 

AACCGGTTGCCAGAAGACAAAGATTACGAACTCGAGAACTCCAAGACAGAGCCAGACGAG 

TCTGTTTTGAAGTGGTTGGGAAACAGGCCGGCGAAGTCGGTGGTTTACGTGGCGTTTGGG 

ACATTGGTGGCTTTGAGCGAAAAACAGATGAAGGAAATTGCAATGGCGATTAGCCAAACC 

GGATATCACTTCTTGTGGTCTGTTAGAGAATCCGAGAGAAGCAAACTACCCTCTGGTTTT 

ATCGAAGAGGCAGAGGAGAAAGACTCTGGACTTGTGGCTAAGTGGGTTCCTCAGCTAGAG 

GTTTTAGCACATGAATCAATCGGGTGTTTCGTGTCACACTGTGGATGGAACTCGACATTG 

GAGGCACTATGCTTAGGGGTTCCAATGGTGGGCGTGCCTCAGTGGACTGATCAGCCCACA 

AATGCTAAGTTTATAGAGGATGTGTGGAAGATTGGGGTTAGAGTGAGGACCGATGGAGAA 

GGGCTTTCGAGTAAAGAAGAGATTGCGAGATGCATTGTTGAGGTCATGGAAGGAGAGAGA 

GGGAAAGAGATAAGGAAGAATGTTGAGAAGCTTAAGGTGTTGGCTCGCGAAGCTATCTCT 

GAAGGAGGTAGTTCCGACAAGAAGATTGATGAGTTTGTTGCTCTTTTGACTTAA 



>UGT74D1 ; 

ATGGGAGAGAAAGCGAAAGCAAATGTGTTAGTCTTCTCATTTCCGATACAAGGTCACATA 

AACCCTCTCCTCCAATTCTCAAAACGCCTACTCTCTAAAAACGTCAACGTCACATTCCTC 

ACCACTTCCTCCACCCACAACTCCATCCTCCGCCGTGCCATCACCGGCGGAGCCACTGCT 

CTTCCTCTCTCTTTTGTCCCCATTGACGATGGATTCGAGGAAGATCACCCATCTACGGAC 

ACATCTCCCGACTACTTCGCAAAGTTCCAAGAAZUICGTATCTCGAAGCCTCT^ 

ATCTCCTCGATGGACCCAAAACCAAACGCCGTCGTTTACGACTCGTGCCTGCCTTATGTC 

CTCGACGTTTGCCGGAAACATCCTGGCGTTGCTGCGGCGTCGTTTTTCACTCAGTCCTCC 

ACCGTGAACGCGACCTATATTCATTTCTTGCGTGGAGAGTTTAAGGAGTTTCAAAATGAT 

GTCGTTTTGCCTGCAATGCCTCCGCTGAAGGGTAATGACTTACCGGTGTTTCTGTACGAT 

AACAATCTCTGCCGGCCGTTGTTTGAGCTCATTAGTAGCCAGTTCGTGAATGTT^^ 

ATTGACTTCTTCTTGGTTAACTCTTTCGACGAACTCGAAGTCGAGGTGCTACAATGGATG 

AAAAACCAATGGCCGGTCAAGAACATAGGACCGATGATTCCATCAATGTACTTAGACAAA 

CGATTAGCAGGTGACAAAGACTACGGAATCAACCTCTTCAATGCCCAAGTCAACGAATGC 

CTTGATTGGCTTGACTCAAAACCGCCCGGTTCAGTGATCTACGTGTCTTTTGGAAGCTTG 

GCCGTCTTAAAAGACGATCAAATGATAGAAGTCGCGGCTGGTCTAAAACAAACTGGCCAT 

AACTTCTTATGGGTTGTTAGAGAAACTGAAACAAAGAAGCTTCCAAGCAATTACATAGAG 

GACATTTGTGACAAGGGATTGATAGTGAATTGGAGTCCTCAATTACAAGTTCTTGCACAT 

AAATCAATCGGTTGTTTCATGACtCATTGCGGGTGGAATTCGACTTTAGAGGCATTGAGC 

TTAGGAGTTGCTTTGATAGGAATGCCGGCTTATAGCGACCAGCCGACTAATGCTAAGTTT 

ATTGAAGATGTGTGGAAGGTTGGGGTTAGGGTTAAGGCAGATCAAAATGGGTTTGTTCCG 

jyVGGAAGAGATTGTGAGATGTGTTGGAGAAGTTATGGAAGATATGTCGGAGAA^ 

GAGATTAGAAA?yvLATGCTCGGAGGTTGATGGAGTTTGCAAGGGAAGCTTTGTCTGATGGA 

ggaaattctgataagaatattgatgagtttgttgctaaaattgtgaggtaa; . 



>UGT74E1 ' • 

atgagagaaggatctcatgttattgttttgcctttcccagcacaaggccacataactcca 
atgtcccaattctgtaaacgcttagcctc?^aaaagtcttaagatcactct^ 
tccgacaagccctctccgccgtacaaaacagagcacgacacaatcactgtcgtccccatc 
tccaatggtttccaagaaggccaggaacgatcagaagacctagatgagtacatggaaaga 

GTAGAATCCAGCATCAAAAACCGCTTACCGAAGTTGATAGAAGACATGAAACTATCGGGA 

aatcctcctagggctcttgtgtacgactccaccatgccgtggcttctggatgtagctcat 

AGTTATGGTTTGAGCGGTGCCGTGTTTTTCACGCAGCCTTGGCTTGTCTCAGCTATTTAC 

tatcatgtattcaagggctcgttctctgtaccgtctacaaagtatggtcactcgacgtta 

GCATCTTTCCCTTCGTTACCGATTCTGAATGCGAATGATTTGCCGTCTTTCCTCTGTGAA 

tcttcctcttacccatatattctaaggactgtgatcgatcagctctcaaacattgatcga 
gttgatatagttttgtgcaacactttcgataaattggaagaaaagttggtgaaatggatt 
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AAAAGCGTGTGGCCTGTCCTGAACATAGGACCAACTGTTCCATCAATGTATTTAGATAAG 
CGACTGGCTGAAGACAAAAACTACGGATTCAGCCTCTTCGGTGCGAAAATCGCTGAAT6C 
ATGGAGTGGCTCAACTCAAAGCAGCCTAGTTCAGTTGTTTATGTATCATTTGGGAGCTTG 
GTGGTTCTAAAA7^GATCAACTGATAGAA.CTAGCGGCGGGTCTGAAACAGAGCGGACAT 
TTCTTTTTGTGGGTTGTGAGAGAGACGGAGAGAAGAAAACTTCCAGAAAACTATATAGAG 
GAAATTGGTGAGAAAGGACTGACCGTGAGCTGGAGTCCACAACTTGAAGTTCTTACACAT 
AAATCGATCGGTTGTTTCGTGACACATTGTGGATGGAACTCGACGTTAGAGGGATTGAGT 
TTGGGAGTTCCAAT6ATTGGTATGCCTCATTGGGCAGATCAGCCTACAAATGCTAAGTTC 
ATGGAGGATGTGTGGAAAGTTGGAGTTAGGGTTAAAGCAGACAGTGATGGGTTCGTGAGA 
AGAGAAGAGTTTGTGAGACGTGTGGAAGAAGTTATGGAGGCAGAGCAAGGTAAAGAGATT 
AGAAAGAATGCTGAGAAATGGAAAGTGTTGGCTCAAGAGGCTGTTTCTGAAGGAGGTAGT 
TCTGATAAGAACATCa^TGAGTTTGTTTCTATGTTTTGTTGA 



>UGT74B2 

ATGAGAGAAGGATCTCATCTTATCGTCTTGCCTTTCCCAGGACAAGGCCACATAACTCCA 
ATGTCCCAGTTCTGCAAACGCTTAGCCTCAAAAGGTCTTAAGCTCACTCTGGTCCTCGTC 
TCCGACAAACCCTCTCCTCCATACAAAACAGAGCACGACTCAATCACTGTCTTCCCCATC 
TCCAACGGCTTCCAAGAAGGCGAGGAACCATTACAAGACCTCGATGATTACATGGAAAGA 
GTAGAAACCAGCATCAAAAACACCTTACCGAAGTTGGTTGAAGACATGAAACTGTCGGGA 
AATCCACCTAGGGCTATCGTGTACGACTCCACCATGCCATGGCTTCTTGATGTAGCTCAT 
AGTTATGGATTGAGCGGTGCCGTGTTTTTCACGCAACCTTGGCTTGTCACAGCTATTTAC 
TACCATGTTTTCAAGGGTTCGTTCTCTGTACCGTCTACAAAGTACGGTCACTCGACATTA 
GCATCTTTCCCTTCGTTCCCGATGCTGACTGCAAATGATTTGCCGTCTTTCCTCTGCGAA 
TCGTCCTCATACCCGAATATACTGAGGATTGTGGTGGATCAGCTCTCAAACATTGATCGA 
GTCGACATAGTGTTGTGCAACACTTTCGATAAATTGGAGGAAAAGTTGTTGAAATGGGTC 
CAAAGCTTGTGGCCAGTCTTGAATATTGGACCAACGGTTCCATCGATGTATTTAGACAAA 
CGACTGTCTGAAGACAAGAACTACGGTTTTAGCCTCTTCAATGCGAAAGTCGCTGAATGC 
ATGGAGTGGCTAAACTCAAAGGAGCCTAATTCTGTTGTCTATTTATCATTCGGAAGTTTG 
GTGATTCTAAAAGAAGATCAAATGTTGGAACTCGCTGCGGGTCTGAAACAGAGCGGACGT 
TTCTTTCTGTGGGTTGTGAGAGAGACAGAGACACACAAACTTCCAAGAAACTATGTCGAG 
GAAATCGGTGAAAAAGGACTTATTGTAAGCTGGAGTCCTCAGCTTGACGTACTTGCACAT 
AAATCAATCGGTTGTTTCTTGACACACTGTGGATGGAACTCGACGTTAGAGGGATTGAGT 
TTGGGAGTTCCAATGATTGGTATGCCACACTGGACTGATCAGCCeACGAATGCTAAGTTC 
ATGCAGGATGTGTGGAAGGTTGGGGTAAGGGTTAAGGCAGAAGGTGATGGGTTTGTGAGA 
AGAGAAGAGATTATGAGAAGTGTGGAAGAAGTTATGGAGGGAGAGAAAGGGAAAGAGATT 
AGAAAGAATGCTGAGAAATGGAAAGTGTTGGCTCAAGAGGCAGTTTCTGAAGGAGGTAGC 
TCTGATAAGAGCATCAATGAGTTTGTTTCTATGTTTTGTTGA 



>UGT74F1 •• . ■ • ■ . .. 

ATGGAGAAGATGAQAGGACATGTATTAGCAGTGCCATTTCCAAGCCAAGGACACATCACC 
CCGATTCGCCAATTCTGCAAACGACTTCACTCCAAAGGTTTCAAAACCACTCACACTCTC 
ACCACTTTTATCTTCAACACAATCCACCTCGACCCATCTAGTCCTATCTCCATAGCCACA 
ATCTCCGATGGCTATGACCAGGGAGGGTTCTCATCAGCCGGTTCTGTCCCGGAGTACCTA 
CAAAACTTCAAAACCTTCGGCTCCAAAACCGTCGCTGATATCATCCGCAAACACCAGAGT 
ACTGATAACCCTATTACTTGTATCGTCTATGATTCTTTCATGCCTTGGGCGCTTGACCTT 
GCAATGGATTTTGGTCTAGCTGCGGCTCCTTTCTTCACGCAGTCTTGCGCCGTTAACTAT 
ATCAATTATCTTTCTTACATAAACAATGGTAGCTTGACACTTCCCATCAAGGATTTGCCT 
CTTCTTGAGCTCCAAGATTTGCCTACTTTCGTCACTCCTACTGGTTCACACCTTGCTTAC 
TTTGAGATGGTGCTTCAACAGTTCACCAACTTCGACAAAGCTGATTTCGTACTCGTTAAT 
TCCTTCCATGACCTCGACCTTCATGAAGAGGAGTTGTTGTCGAAAGTATGTCCTGTGTTG 
ACAATTGGTCCAACTGTTCCATCAATGTACTTAGACCAACAGATCAAATCAGACAACGAC 
TATGATCTGAACCTCTTTGACTTAAAAGAAGCTGCCTTATGCACTGACTGGCTAGACAAG 
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AGGCCAGAAGGATCGGTAGTATATATAGCTTTTGGGAGCATGGCTA7ACTGAGTAGTGAG 

CAGATGGAAGAGATTGCTTCGGCGATAAGCAACTTCAGCTACCTCTGGGTTGTCAGAGCT 

TCAGAGGAGTCAAAGCTCCCACCAGGGTTTCTTGAAACAGTGGATAAAGACAAGAGCTTG 

GTCTTGAAGTGGAGTCCTCAGCTTCAAGTTCTGTCAAACAAAGCCATCGGTTGTTTCATG 

ACTCACTGTGGCTGGAACTCAACCATGGAGGGTTTGAGTTTAGGGGTTCCCATGGTGGCT 

ATGCCTCAATGGACTGATCAACCAATGAATGCAAAGTATATACAAGATGTATGGJ^ 

GGGGTTCGTGTGAAAGCAGAGAAAGAAAGTGGCATTTGCAAAAGAGAGGAGATTGAGTTT 

AGCATCAAGGAAGTGATGGAAGGAGAGAAGAG(aAAGAGATGAAAGAGAATGCGG^ 

TGGAGAGACTTGGCTGTGAAGTCACTCAGTGAAGGAGGTTCTACAGATATCAACATTAAC 

GAATTTGTATCAAAAATTCAAATCAAATAA . 



>UGT74F2 

ATGGAGCATAAGAGAGGACATGTATTAGCAGTGCCGTACCCAACGCAAGGACACATCA^^ 
CCATTCCGCCT^TTCTGCWy^CGACTTCACTTCAAAGGTCTC^^ 
ACCACTTTCGTCTTCAACTCCATCAATCCTGACCTATCCGGTCCAATCTCCATAGCCACC 
ATCTCCGATGGCTATGACCATGGGGGTTTCGAGACAGCTGACTCCATCGACGACTACCTC 
AAAGACTTTAAAACTTCCGGCTCGAAAACCATTGCAGACATCATCCAAAAACACCAGACT 
AGTGATAACCCCATCACTTGTATCGTCTATGATGCTTTCCTGCCTTGGGCACTTGACGTT 
GCTAGAGAGTTTGGTTTAGTTGCGACTCCTTTCTTTACGCAGCCTTGTGCTGTTAACTAT 
GTTTATTATCTTTCTTACATAAACAATGGAAGCTTGCAACTTCCCATTGAGGAATTGCCT 
TTTCTTGAGCTCCAAGATTTGCCTTCTTTCTTCTCTGTTTCTGGCTCTTATCCTGCTTAC 
TTTGAGATGGTGCTTCAACAGTTCATAAATTTCGAAAAAGCTGATTTQGTTCTCGTTAAT 
AGCTTCGAAGAGTTGGAACTGCATGAGAATGAATTGTGGTCGAAAGCTTGTCCTGTGTTG ; 
ACAATTGGTCCAACTATTCCATCAATTTACTTAGACCAACGTATCAAATCAGACACCGGC 
TATGATCTTAATCTCTTTGAATCGAAAGATGATTCCTTCTGCATTAACTGGCTCGACACA 
AGGCCACAAGGGTCGGTGGTGTACGTAGCATTCGGAAGCATGGCTCAGCTGACTAATGTG 
* CAGATGGAGGAGCTTGCTTCAGCAGTAAGCAACTTCAGCTTCCTGTGGGTGGTC^^ 
TCAGAGGAGGAAAAACTCCCATCAGGGTTTCTTGAGACAGTGAATAAAGAAAAGAGCT.TG : 
GTCTTGAAATGGAGTCCTCAGCTTCAAGTTGTGTCAAACAAAGCCATCGGTTGTTTCTTG 
ACTCACTGTGGCTGGAACTCAACCATGGAGGCTTTGACCTTCGGGGTTCCCATGGTGGCA ; 
ATGCCCCAATGGACTGATCAACCGATGAACGCMAGTACATACAAGATGTGTGGAAGGCT /. . 
GGAGTTCGTGTGAAGACAGAGAAGGAGAGTGGGATTGCCAAGAGAGAGGAGATTGAGTTT 
AGCATTAAGGAAGTGATGGAAGGAGAGAGGAGCAAAGAGATGAAGAAGAACGTGAAGAAA 
TGGAGAGACTTGGCTGTCAAGTCACTCAATGAAGGAGGTTCTACGGATACTAACATTGAT 
ACATTTGTATCAAGGGTTCAGAGCAAATAG 



>UGT75B1 • ■ 

ATGGCGCCACCGCATTTTCTACTGGTAACGTTTCCGGCGCAAGGTCACGTGAACCCATCT 
CTCCGTTTTGCTCGTCGGCTCATCAAAAGAACCGGCGCACGTGTCACTTTCGTCACTTGT 
GTCTCCGTCTTCCACAACTCCATGATCGCAAACCACAACAi\AGTCGAAAATCTCTCTTTC 
CTTACTTTGTCCGACGGTTTCGACGATGGAGGCATTTCCACCTACGAAGACCGTCAGAAA 
AGGTCGGTGAATCTCAAGGTTAACGGCGATAAGGCACTATCGGATTTCATCGAAGCTACT 
AAGAATGGTGACTCTCCCGTGACTTGCTTGATCTACACGATTCTTCTCAATTGGGCTCCA 
AAAGTAGCACGTAGATTTCAACTTCCCTCCGCTCTTCTCTGGATCCAACCGGCTTTGGTT 
TTCAACATCTATTACACTCATTTCATGGGAAACAAGTCCGTTTTCGAGTTACCTAATCTG 
TCTTCTCTGGAAATCAGAGATCTTCCATCTTTCCTCACACCTTCCAACACAAACAAAGGC 
GCATAC6ATGCGTTTCAAGAAATGATGGAGTTTCTCATAAAAGAAACCAAACCGAAAATT 
CTCATCAACACTTTCGATTCGCTGGAACCAGAGGCCTTAACGGCTTTCCCGAATATCGAT 
ATGGTGGCGGTTGGTCCTTTACTTCCCACGGAGATTTTCTCAGGAAGCACCAACAAA.TCA 
GTTAAAGATCAAAGTAGTAGTTATACACTTTGGCTAGACTCGAAAACAGAGTCCTCTGTT 
ATTTACGTTTCCTTTGGAACAATGGTTGAGTTGTCCAAGAAACAGATAGAGGAACTAGCG 
AGAGCACTCATAGAAGGGAAACGACCGTTTTTGTGGGTTATAACTGATAAATCCAACAGA 
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GAAACGAAAACAGAAGGAGAAGAA.GAGACAGAGATTGAGAAGATAGCTGGATTCAGACAC 

GAGCTTGAAGAGGTTGGGATGATTGTGTCGTGGTGTTCGCAGATAGAGGTTTTAAGTCAC 

CGAGCCGTAGGTTGTTTTGTGACTCATTGTGGGTGGAGCTCGACGCTGGAGAGTTTGGTT 

CTTGGCGTTCCGGTTGTGGCGTTTCCGATGTGGTCGGATCAACCGACGAACGCGAAGCTA 

CTGGAAGAAAGTTGGAAGACTGGTGTGAGGGTAAGAGAGAACAAGGATGGTTTGGTGGAG 

AGAGGAGAGATCAGGAGGTGTTTGGAAGCCGTGATGGAGGAGAAGTCGGTGGAGTTGAGG 

GAAAACGCAAAGAAATGGAAGCGTTTAGCGATGGAAGCGGGTAGAGAAGGAGGATCTTCG 

GATAAGAACATGGAGGCTTTTGTGGAGGATATTTGTGGAGAATCTCTTATTCftA^ 

TGTGAAGCAGAGGAGGTAAAAGTAAA6TAA ' 



>UGT75B2 

ATGGCGCAAGCGCATTTTCTACTGGTAACGTTTCCGGCGCAAGGTCACGTGAACCCATCT 

CTCCGTTTTGCTCGTCGGCTCATCAAAACAACTGGCGCACGTGTAACTTTCGQCACGTGT 
CTCTCTGTCATTCACCGCTCTATGATCCCAAACCACAACAACGTCGAAAATCTCTCTTTC 
CTTACTTTCTCCGACGGATTCGACGACGGAGTCATCTCCAACACCGACGACGTCCAAAAC 
CGGTTGGTACACTTCGAACGTAATGGCGATAAAGCTCTATCGGATTTCATCGAAGCTAAT 
CAGAATGGTGACTCTCCCGTAAGTTGCTTGATCTACACGATTCTTCCCAACTG6GTTCCA 
AAAGTGGCGCGTAGATTTCATCTTCCCTCTGTTCATCTCTGGATCCAACCAGCCTTCGCT 
TTCGACATTTATTACAATTACTCTACT^GGAAACAACTCCGTTTTCGAGTTCCCGAATCT^ 
CCTTCTCTCGAAATCCGCGATCTGCCTTCTTTCCTCTCACCTTCCAACACGAACAAAGCC 
GCACAAGCAGTATATCAAGAACTGATGGATTTTCTCAAAGAAGAATCTAACCCGAA?^TT 
CTCGTCAACACATTCGATTCGCTGGAGCCAGAGTTCTTAA.CAGCTATTCCGAATATAGAA 
ATGGTGGCAGTTGGTCCTTTACTTCCTGCGGAGATTTTCACTGGAAGCGAATGAGGTAAA 
GATTTATCAAGAGATCATCAZ^GTAGTAGTTATACACTTTGGTTAGACTCGAAAACAGAG 
TCCTCTGTTATTTATGTTTCTTTTGGAACAATGGTTGAGTTGTCGAAGAAA.CAGATAGAG 
GAACTAGCGAGAGCACTCATAGAAGGGGGAAGACCGTTCTTGTGGGTTATAACTGATAAA 
CTCAACAGAGAAGCGAAAATAGAAGGAGAAGAAGAGACAGAGATTGAGAAGATAGCTGGT 
TTTAGACACGAGCTTGAAGAGGTTGGGATGATTGTCTCGTGGTGTTCGCAGATAGAGGTT 
TTGAGACACCGAGCCATAGGTTGTTTTTTGACTCATTGTGGGTGGAGCTCATCACTGGAG 
AGTTTGGTTCTCGGCGTTCCAGTGGTGGCGTTTCCGATGTGGTCGGATCAGCCAGCAAAT 
GCGAAGCTTTTGGAAGAAATATGGAAGACAGGTGTGAGGGTGAGAGAGAACTCGGAAGGT 
TTAGTAGAGAGAGGAGAGATAATGCGGTGTTTGGAAGCAGTGATGGAGGCGAAATCGGTG 
GAGCTGAGGGAZyiACGCAGAGAAATGGAAGCGTTTAGCGACTGAAGCGGGTAGAGAAGGA 
GGATCTTCGGACAAGAATGTGGAAGCTTTTGTGAAGAGTCTGTTTTGA 



>UGT75CI \ : 

atggccacttccgtcaatggttcccatcgtcgtccacattacttgcttgtaacattccca 

gcgcaaggtcacatcaacccggcgcttcaactagccaaccgcctcatccaccacg^^ 

accgtcacatactccaccgcagtctctgctcaccgacgtatgggcgagccaccttccaca 

aaaggtctatccttcgcttggttcaccgatggattcgacgacggtctcaagtcattcgaa 

gac(::agaaaatctaca.tgtccgaactcaaacgatgtggttcaaa.cgccctgagagac^^ 

atcaaagcci?^tcttgacgccacc:accgaaacaga 

gttctcgtcccgtgggtttctacggtagcgcgtgagtttcacctcccaactacacttctc 
tggattgaaccagctactgtactagacatctactactactacttcaacacctcttacaaa 
catctcttcgacgttgaaccgattaaattaccgaaactgccactgatcaccaccggtgac 
ctcccgtcgtttcttcaaccttcgaaggcattaccgtcagctcttgtgactctaagagaa 
catatcgaagctctcgaaacggaatcaaaccctaagattcttgttaacacattctctgct 
ttggaacacgatgctttaacctctgttgagaaactcaagatgatcccaatcggaccgttg 
gtttcttcctccgagggtaaaaccgatcttttcaaatcttccgacgaggattacacgaaa 
tggttagactcgaagctcgagagatcagtgatttacatttccttaggcacacacgccgat 
gatttaccagagaaacacatggaagcgcttactcacggcgtgttagctacaaacagaccg 
tttttatggatcgtgagggagaaaaatccagaagagaagaagaagaatcggtttcttgaa 
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TTGATCAGAGGAAGTGATCGAGGATTGGTGGTGGGATGGTGTTCTCAGACAGCTGTTTTG 
GCGCATTGTGCTGTGGGATGTTTTGTGACTCATTGTGGTTGGAATTCGACGTTGGAGAGT 
TTAGAGAGTGGTGTTCCGGTGGTTGCGTTTCCGCAGTTTGCTGATCAGTGTACAACGGCG 
AAGCTTGTGGAGGATACGTGGAGGATTGGAGTGAAGGTGAAGGTTGGGGAGGAAGGAGAT 
GTGGATGGGGAGGAGATTAGAAGGTGTTTGGAGAAGGTGATGAGTGGTGGAGAAGAGGCG 
GAGGAGATGAGAGAGAATGCAGAGAAGTGGAAGGCGATGGCTGTTGATGCGGCAGCGGAA 
GGTGGACCGTCGGATTTGAATCTTAAAGGTTTTGTGGACGAGGATGAGTAG 



>UGT75D1 — 

ATGGCCAACAACAATTCCAACTCTCCCACCGGTCCACACTTTCTATTCGTAACATTTCC^ 

GCCCAAGGTCACATCAACCCATCTCTCGAGCTAGCCAAACGCCTCGCCGGAAC^ 

GGTGCTCGAGTCACCTTCGCCGCCTCAATCTCTGCCTACAACCGCCGCATGTTCTCTACA 

GAAAACGTCCCCGAAACCCTAATCTTCGCTACCTACTCCGATGGCCACGACGACGGTTTC 

AAATCCTCTGCTTACTCCGACAAATCTCGTCAAGACGCCACTGGAAACTTCAT6TCTGAG 

AT6A(3ACGACGTGGCAAAGAGACa.CTAACCGAACTAATCGAAGATAACCGGAAAC^^ 

AGGCCTTTTACTTGCGTGGTTTACACGATTCTCCTCACTTGGGTCGCTGAGCTAGCGCGT 

GAGTTTCATCTTCCTTCTGCTCTTCTTTGGGTCCAACCAGTAACAGTCTTCTCCATTTTT 

TACCATTACTTCAATGGCTACGAAGATGCAATCTCAGAGATGGCTAATACCCCCTCTAGT 

TCTATTAAATTACCTTCTCTGCC?ICTGCTTACTGTCCGTGATATTCCTTCTTTCATT6TC 

TCTTCCAATGTCTACGCGTTTCTTCTACCCGCGTTTCGAGAACAGATTGATTCACTGAAG 

GAAGAAATi^CCCTAAGATCCTCATCAAOlCTTTCCAAGAGCTTGAGCCAG^ 

AGCTCGGTTCCAGATAATTTCAAGATTGTCCCTGTCGGTCCGTTACTAACGTTGAGAACG 

GATTTTTCGAGTCGCGGTGAATACATAGAGTGGTTGGATACTAAAGCC5GATTCGTCTGTG 

CTTTATGTTTCGTTCGGGACGCTTGCCGTGTTGAGCAAGAAACAGCTTGTGGAGCTTTGT 

AAAGCGTTGATACA?y^GTCGGAGACCATTCTTGTGGGTGATTACGGATAAGTCGTACAGA 

AATAAAGAAGATGAGCAAGAGAAGGAAGAAGATTGCATAAGTAGTTTCAGAGAAGAGCTC 

GATGAGATAGGiyVTGGTGGTTTCATGGTGTGATCAGTTTAGGGTTTTGAATCATAGATCG 

ATAGGTTGTTTCGTGACGCATTGCGGGTGGAACTCTACGCTGGAGAGCTTGGTTTCAGGA 

GTTCCGGTGGTGGCGTTTCCGCAATGGAATGATCAGATGATGAACGCGAAGCTTTTAGAA 

GATTGTTGGAAAACAGGTGTAAGAGTGATGGAGAAGAAGGAAGAAGAAGGAGTTGTGGT6 

GTGGATAGTGAGGAGATACGGCGGTGCATTGAGGAAGTTATGGAAGACAAGGCGGAGGAG 

•TTTAGAGGAAATGCCACGAGGTGGAAGGATTTAGCGGCGGAGGCTGTQAGAGAAGGAGGC 

TCTTCCTTTAATCATCTCAAAGCTTTTGTCGATGAGCACATGTGA 



>UGT76B1 • : 

ATGGAGACTAGAGAAACAAAACCAGTGATCTTTCTCTTCCCTTTCCCTTTACAAGGTCAC 
TTAAACCCAATGTTTCAGCTCGCCAACATCTTCTTCAACAGAGGCTTCTCCATCACTGTG 
ATCCACACTGAGTTCAACTCTCCAAACTCTTCCAATTTCCCTCATttCACTTTCGTATCC 
ATCCCCGATAGCTTGTCTGAACCTGAATCCTATCCCGATGTCATCGAGATTCTCCATGAC 
CTCAATTCCAAGTGTGTTGCTCCTTTTGGTGATTGCTTAAAGAAGCTTATATCTGAAGAA 
CCAACAGCAGCTTGTGTGATTGTTGACGCTCTTTGGTACTTCACTCACGATTTAACCGAG 
AAATTCAATTTCCCGAGGATTGTTCTCCGAACCGTTAACCTCTCAGCTTTCGTCGCTTTC 
TCAAAGTTTCATGTTTTACGAGAGAAAGGGTATCTTTCTTTACAAGAGACTAAGGCAGAC 
Ta^CCGGTTCCGGAGCTTCCGTATCTTAGAATGAAGGATCTTCCATGGTTCCAGACAGAA 
GATCCAAGATCAGGGGATAAGTTACAGATAGGTGTGATGAAGTCACTAAAGTCTTCCTCA 
GGAATCATATTCAACGCCATTGAAGATCTTGAAACAGATCAGCTTGATGAAGCCCGCATA 
GAATTCCCAGTTCCACTCTTCTGTATTGGACCCTTTCACAGGTACGTTTCAGCTTCATCC 
AGTAGCTTACTTGCACACGACATGACTTGTCTCTCCTGGTTAGACAAGCAAGCAACAAAT 
TCCGTAATCTACGCAAGTCTTGGAAGCATTGCTTCGATCGATGAATCTGAATTCTTGGAG 
ATTGCTTGGGGTCTAAiSAJ^CAGCAACCAACCTTTTCTATGGGTGGTTAGACCCGGTTTA 
ATCCACGGGAAAGAATGGATCGAGATTCTGCCTAAAGGGTTCATCGAAAATCTCGAGGGC 
CGGGGTAAAATAGTGAAATGGGCACCTCAGCCTGAAGTTTTAGCTCACCGTGCAACAGGC 
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GGATTCTTAACACATTGTGGATGGAACTCAACACTTGAGGGCATATGTGAAGCTATACCA 
ATGATATGCAGACCATCTTTTGGGGACCAGAGGGTGAATGCTAGATACATTAACGATGTT 
TGGAAGATCGGATTGCATTTGGAAAACAAGGTAGAGAGACTAGTGATCGAAAACGCGGTT 
AGAACACTAATGACGAGCTCGGAAGGGGAAGAGATCCGC7AGAGGATTATGCCCATGAAG 
GAAACTGTTGAACAATGCCTTAAGCTTGGAGGTTCATCATTTCGGAATCTCGAAAACTTA 
ATTGCTTATATATTGTCTTTCTAA 



>UGT76C1 

ATGGAGAAGAGAAACGAGAGACAAGTGATTCTTTTTCCTCTACCATTACAAGGTTGCATA 

T^CCCTATGCTTCAGCTAGCAAAGATCCTTTACTCi^GAGGTTTTTCGATCACCATCATC 

CAOVCGCGCTTCAACGCGCCCAAATCTTCAGACCATCCTCTCTTCACTTTCTTACa^ 

CGCGACGGCTTGTCTGAATCTCAGACTCAATCTCGTGATCTTTTGCTTCAACTCACGCTT 

CtCAACAACAATTGTCAGATCCCATTTCGAGAGTGTTTGGCTAAACTCATTAAACCTAGT 

TCAGATTCAGGAACAGAGGATAGGAAAATTAGCTGTGTGATCGATGATTCCGGTT6GGTT 

TTCACACAATCCGTGGCGGAGAGTTTTAATCTTCCTCGATTTGTCCTCTGTGCTTATAAG 

TTCTCTTTCTTTCTCGGACATTTTCTTGTTCCTCAGATTCGTCGTGAAGGGTTTCTTCCA 

GTACO^GATTCGGAGGCAGATGATCTAGTTCCTGAGTTTCCACCGCTTCGAAAGAAAGAT 

CTTTCGAGAATTATGGGAACCAGCGCTCAGAGTAAGCCTCTAGATGCTTACTTGCTTAAG 

ATACTCGACGCGACGAAGCCAGCTTCAGGGATTATAGTTATGTCCTGCAAAGAGCTTGAC 

CATGATTCACTTGCTGAGTCCAACAAAGTTTTCAGCATTCCGATATTTCCCATTGGCCCT 

TTTCACATTCATGACGTCCCAGCCTCGTCTAGCAGCTTGTTAGAACCGGACCAGAGTTGC 

ATTCCATGGTTAGATATGCGTGAAACGAGATCAGTAGTCTACGTGAGCTTAGGGAGCATT 

GCGAGTCTTAACGAGTCTGACTTCTTGGAGATTGCTTGTGGACTAAGAAACACCAACCAA 

TCCTTCTTGTGGGTTGTCCGGCCTGGTTCAGTCCATGGCAGAGATTGGATCGAATCATTA 

CCTTCAGGGTTCATGGAAAGTCTCGATGGTAAAGGAAAGATAGTGAGATGGGCACCGCAG 

CTAGACGTTCTTGCGCATAGAGCCACGGGAGGGTTTTTGACTCATAATGGATGG/y^CTCG 

ACATTAGAGAGTATATGCGAAGGAGTACCTATGATCTGCTTGCCTTGTAAGTGGGACCAA 

TTTGTAAACGCGAGATTCATAAGCGAAGTTTGGAGGGTTGGGATTCACTTGGAAGGTCGG 

ATAGAGCGAAGAGAAATCGAGAGAGCTGTTATAAGACTAATGGTTGAGTCGAAAGGAGAA 

GAGATTCGAGGTAGAATCAAAGTCTTGCGAGACGAAGTAAGAAGGTCAGTTAAACAAGQA 

GGTTCGTCATATCGATCTTTAGATGAGTTGGTTGATCGTATATCAATCATCATCGAGCCA 

CTAGTGCCTACGTGA 



>UGT76C2 . 

ATGGAGGAGAAGAGAAATGGTCTGCGTGTGATTCTCTTCCCTCTTCCATTACAAGGTTGC 

ATCAACCCTATGCTTCAGCTCGCCAACATCCTTCACGTAAGAGGCTTCTCCATTACCGTG 

ATCCy^CACGCGCTTCAACGCGCGAAAAGCTTO^GCCATCCTCTCTTCAC 

ATTCCTGATGGTTTGTCTGAAACGGAGATTCAAGATGGTGTTATGTCTTTGCTCGCGCAA 

ATCAACCTTAACGCTGAGTCTCCGTTTCGTGATTGCTTGCGTAAAGTGTTGCTGGAATCA 

AAAGAGTCACa^GAGGGTTACTTGTTTGATCGATGACTGTGGATGGCTCTTCACACAATCT 

GTTTCAGAGAGTTTGAAGCTTCCGAGGCTCGTTCTCTGTACTTTTAAAGCCACTTTCTTC 

AATGCTTATCCGAGTCTTCCACTTATCCGAACCAAGGGATATCTTCCAGTTTCAGAATCG 

GAAGCAGAGGACTCTGTTCCTGAGTTCCCGCCGCTTCAAAAGAGAGATCTTTCAAAGGTT 

TTCGGGGAGTTCGGAGAGAAACTCGATCCGTTCTTACATGC'TGTAGTCGAAACGACAATA 

AGATCTTCAGGGTTAATATACATGTCCTGCGAAGAGCTTGAGAAAGATTCGTTGACTCTT 

TCTAACGAAATTTTTAAAGTTCCGGTTTTTGCAATTGGTCCGTTTCACAGCTACTTCTCT 

GCTTCGTCAAGCAGCTTGTTCACACAAGACGAGACTTGCATTCTGTGGTTAGATGATCAA 

GAAGATAAATCTGTGATCTACGTTAGTCTAGGAAGCGTTGTGAACATAACGGAAACAGAG 

TTCTTGGAGATTGCGTGTGGTTTAAGCAATAGCAAACAGCCTTTCTTGTGGGTAGTACGA 

CCCGGTTCA6TACTCGGCGCGAAATGGATCGAACCGCTCTCTGAAGGGCTGGTTAGTAGC 

CTTGAAGAGAAAGGAAAGATTGTGAAATGGGCACCACAACAGGAGGTTCTTGCGCATCGT 

GCCACAGGAGGGTTTTTGACACACAATGGTTGGAACTCAACGCTAGAGAGTATATGCGAA 
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GGGGTTCCTATGATCTGCCTACCAGGAGGTTGGGATCAAATGCTGAATTCAAGATTTGTT 
AGCGATATTTGGAAGATTGGAATTCACTTGGAAGGTCGGATTGAAAAAAAGGAGATTGAG 
AAAGCTGTGAGGGTGTTAATGGAGGAAAGTGAAGGAAATAAGATTCGTGAGAGAATGAAA 
GTTCTGAAAGATGAGGTCGAGAAATCGGTCAAACAAGGAGGCTCATCTTTTCAATCTATT 
GAGACTCTAGCTAATCATATACTATTGTTGTAA 



>UGT76C3 . 

ATGGATAAGAGTAATGGCCTACGAGTGATTCTGTTTCO^CTTCCATTACAAGGATGCATC 

AACCCCATGATTCAGCTAGCGAAGATCCTCCACTCAAGAGGTTTCTCCATCACTGTGATC 

CAO^CGCGCTTCAATGCGCCAA^AGCTTCAAACCACCCTCTGTTCACCT^ 

CCAGATGGCTTGTCTGAAACAGA6ACAAGAACTCACGATATCACACTTCTCCTAACGCTT 

CTCAACCGAAGCTGTGAGTCTCCATTTCGTGAATGTTTGACTAAACTTTTGCAGTCTGCA 

GATTCAGAAACAGGGGAAGAGAAACAGAGGATTAGCTGTTTGATCGATGATTCTGGATGG 

ATATTCACy^CT^GCCCGTTGCTCAGAGTTTCAATCTCCCGAGATTGGTCCTTAACACCTAC 

AAAGTCTCCTTCTTTCGGGACCATTTTGTTCTTCCTCAACTCCGTCGTGAAATGTATCTT 

CCATTACAAGATTCAGAftiCAAGGTGATGATCCAGTTGAGGAGTTTCCACCCCTTCGAAAG 

AAAGATCTTTTACAAATTCTTGATCAAGAATCGGAGCAACTAGACTCGTACTCCAATATG 

ATTTTGGAAACAAO^AAAGCGTCTTCAGGTCTTATATTTGTATCCACATGTGAAGAGTTG 

GACCAAGACTCACTGAGTCAAi3CACGTGAAGATTATCAAGTCCCAATCTTTACGATAGGA 

CCTTCTCATAGCTACTTCCCAGGCTCATCTAGTAGCTTGTTCACAGTG6ACGAGACTTGC 

ATTCCATGGTTAGACAAQCAAGAAGACAAATCCGTGATTTACGTGAGTTTTGGGAGCATC 

TCGACCATTGGCGAAGCAGAATTCATGGAGATTGCTTGGGCTCTAAGAAACAGCGACCAA 

CCGTTCTTGTGGGTCGTACGGGGTGGTTCGGTAGTCCATGGTGCAGAATGGATCGAACAG 

CTTCATGAGAAAGGAAAGATAGTGAATTGGGCCCCACAACAAGAGGTTCTAAAGCATCAA 

GCCATTGGAGGATTCTTGACACACAATGGTTGGAACTCGACGGTTGAGAGTGTTTTTGAA 

GGCGTCCCTATGATATGTATGCCTTTTGTATGGGACCAATTGCTTAATGCAAGATTTGTT 

AGTGATGTATGGATGGTTGGGCTGC7VTCTAGAG6GTCGGATTGAGAGGAATGTGATTGAG 

GGAATGATAAGASlGATTATTTTCGGAAACTGAAGGAAAAGCGATCCGAGAGAGGATGGAA 

ATTCTTAAGGAGAATGTAGGAAGATCCGTTAAACCAAAAGGTTCGGCGTATCGATCGTTA 

CAACATTTGATTGATTATATAACATATTTCTAG 



>UGT76C4 ; ■ 

ATGGAGAAGAGTAATGGCCTGCGAGTGATTCTGTTTCCACTTCCATTACAAGGCTGCATC 

aaccctatgattcagctcgccaagatcctccactcaagaggtttttcaatcactgtgatc 

cacacttgcttcaacgcgccaaaagcttcaagccatccactcttcaccttcatacagatc 

aUlGATGGCTTGTCTGAAACAGAGACAAGAACTCGCGACGTCyyyiCTTCTCATAACA^ 

ctcaaccaaaattgcqagtctccggttcgtgAatgtttgcgtaaactgttgcaatctgcc 
aaggaagagaaacagaggattagctgtttgatcaatgattctggttggatcttcactcaa 
cacttagccaagagtttgaatctcatgagattggcctttaatacctataagatctccttc 
tttcgaagccattttgttcttcctcagctccggcgtgaaatgtttcttccattacaagat 

TCAGAACAAGATGATCCAGTTGAGAAGTTTCCACCGCTTAGAAAGAAAGATCTTTTACGG 
ATTCTTGAAGCAGATTCGGTGCAGGGAC3ACTCGTACTCGGATATGATTTTGGAAAAGACA 
AAGGCGTCTTCAGGTCTTATATT.CATGTCCTGTGAAGAGTTGGACCAAGACTCACTGAGT 
CAATCACGTGAAGATTTTAAGGTTCCGATATTTGCGATAGGACCTTCTCATAGCCATTTT 
CCTGCTTCTTCTAGTAGCTTGTTCACACCGGACGAGACTTGCATCCCATGGTTAGACAGA 
CAAGAAGACAAATCCGTAATATACGTGAGTATTGGGAGCCTCGTGACCATCAACGAAACA 
GAGCTAATGGAGATTGCTTGGGGTCTAAGTAACAGCGACCAACCATTTTTATGGGTCGTC 
CGGGTTGGTTCAGTCAATGGCACGGAATGGATTGAAGCAATCCCGGAATATTTCATCAAA 
AGGCTTAATGAGAAGGGAAAGATAGTGAAATGGGCTCCACAACAAGAGGTTCTAAAGCAT 
CGAGCTATTGGAGGTTTCTTGACACATAATGGTTGGAACTCGACGGTTGAGAGTGTTTGT 
GAAGGCGTCCCTATGATCTGTTTGCCTTTTCGTTGGGACCAATTGTTAAATGCAAGATTT 
GTTAGTGATGTATGGATGGTTGGGATACATCTCGAGGGTCGGATTGAGAGGGATGAGATC 
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GAGAGAGCGATAAGGAGATTATTGTTGGAAACTGAAGGAGAAGCCATCCGAGAGAGGATA 
a^CTTCTTAAGGAAAAAGTAGGAAGATCAGTTAAACAAAACGGTTCGGCATATO^ 
CTACAAAATTTGATTAATTATATATCATCTTTCTAG 



>UGT76C5 

ATGGAGAAGAGTAATGGCCTTCGAGTGATTCTGTTTCCACTTCCATTACAAGGCTGCATC 

AACCCCATGATTCAGCTCGCCAAGATCCTCCACTCAAGAGGTTTCTCCATCACTGTGATC 

(:a.CACGTGCTTCAACGCGCa^AAAGCTTCAAGCCATCCTCTCTTCACCTTCTT^ 

CCAGATGGCTTGTCCGAAACAGAGAAAAGAACTAACAATACCAAACTTCTCCTAACGCT 

CTCAACCGGAACTGTGAGTCTCCGTTTCGTGAATGTTTGAGTAAACTGTTGCAGTCTGCA 

GATTCAGAAACAGGGGAAGAGAAACAGAGGATTAGCTGTTTGATCGCTGATTCTGGATGG 

ATGTTCACACAACCCATTGCTCAGAGTTTGAAACTCCCAATATTGGTCCTCAGTGTGTTT 

ACAGTCTCCTTCTTTCGCTGCCAATTTGTTCTTCCTAAGCTTCGGCGTGAAGTGTATCTT 

CCACTTCAAGATTCAGAACAGGAGGATCTAGTTCAAGAGTTTCCGCCGCTTCGAAAGAAG 

GATATTGTACGTATTCTTGATGTAGAAACAGATATACTAGATCCATTCTTGGACAAAGTT 

CTACAAATGACAAAGGCGTCTTCAGGTCTTATATTCATGTCATGTGAAGAGTTGGACCAC 

GACTCAGTGAGTCAGGCACGTGAAGATTTCAAAATTCCTATCTTTGGGATTGGACCATCT 

CAGAGCCACTTTCCAGCTACCTCTAGTAGCTTGTCCACACCCGACGAGAC^ 

TGGTTAGACAAACAAGAAGACAAATCCGTGATTTACGTCAGTTACGGGAGCATCGTGACC 

ATCAGCGAATCAGATTTAATAGAGATTGCTTGGGGTCTAAGAAACAGCGACCAACCCTTC 

TTGTTGGTCGTACGGGTTGGTTCAGTCCGTGGCAGAGAATGGATCGAGACAATCCCGGAA 

GAGATCATGGAAAAGCTTAATGAGAAGGGAAAGATAGTGAAATGGGCTCCGCAACAAGAC 

GTTCTAAAGCATCGAGCCATTGGGGGATTCCTGACACATAATGGTTGGAGCTCGACTGTT 

GAGAGTGTTTGTGAAGCAGTCCCTATGATCTGTTTGCCTTTTCGTTGG6ACCAAATGCTA 

AATGCAAGATTTGTTAGCGATGTATGGATGGTCGGGATAAACCTAGAGGATCGGGTTGAA 

AGGAATGAGATCGAGGGAGCGATAAGGAGATTATTGGTGGAACCTGAAGGAGAAGCCATC 

CGAGAGAGGATAGAAGATCTTAAGGAGAAAGTAGGACGATCGTTTO^CAAAACG 

GGATATCAATCGTTACAAAATTTGATTGATTATATATCATCTTTTTAG 



>UGT76D1 . \ ' • 

ATGGCAGAGATTCGCCAGAGAAGAGTGTTGATGGTCCCAGCACCGTTCCAAGGCCATTTA 

CCTTCGATGATGAATCTAGCGTCCTACCTTTCTTCCCAAGGCTTTTCy^TCACAATC 

AGAAACGAATTCAATTTCAAAGATATCTCCCATAATTTCCCTGGTATAAAATTCTTCACC 

ATCAAGGACGGCTTGTCAGAATCTGACGTGAAGTCTCTGGGTCTCCTTGAATTTGTCCTG 

GAGCTTAACTCTGTCTGTGAACCCCTATTGAAAGAGTTTCTAACCAACCATGATGATGTT 

GTTGACTTTATCATTTATGATGAATTTGTTTACTTCCCTCGACGTGTTGCGGAAGATATG 

AATCTGCCAAAGATGGTCTTTAGCCCTTCTTCCGCCGCTACCTCGATCAGCCGGTGTGTG 

CTTATGGAGAACCiy^TCAAATGGGTTACTTCCTCCACAAGACGCAAGATCTC^ 

GAAACGGTGCCAGAGTTTCATCCCTTTCGTTTCAAAGATCTGCCTTTTACAGCTTATGGA 

TCTATGGAGAGATTAATGATACTTTACGAGAATGTAAGCAATAGAGCCTCATCTTCTGGC 

ATAATACACAACTCTTCGGATTGCTTAGAGAACTCATTCATAACAACTGCACAAGAGAAA 

TGGGGAGTTCCGGTATACCCGGTTGGTCCACTCCATATGACCAATTCCGGAATGTCATGT 

CCAAGTTTATTTGAAGAAGAAAGAAAGTGTCTTGAATGGCTTGAGAAGCAAGAAACAAGC 

TCAGTGATCTACATJ\AGCATG6GGA6CTTGGCGATGACACAAGATATAGAGGCTGTGGAG 

ATGGCCATGGGATTTGTCCAGAGTAATCAACCCTTCTTGTGGGTGATCCGACCAGGCTCT 

ATAAACGGACAAGAATCTTTAGACTTCTTACCGGAACAGTTCAACCAAACGGTGACCGAT 

GGAAGAGGTTTTGTTGTGAAATGGGCCCCACAT^AAAGAGGTATTAAGGCATAGAGCAGTG 

GGAGGGTTTTGGAACCATGGTGGATGGAACTCGTGCTTGGAGAGCATAAGCAGTGGTGTA 

CCAATGATTTGTAGGCCGTATTCTGGTGATCAGAGGGTGAATACTCGACTTATGTCACAT 

6TTTGGCAAACCGCGTATGAGATCGAAGGTGAATT6GAAAGAGGAGCTGTTGAGATGGCC 

GTGAGGAGGCTCATTGTGGATCAAGAAGGTCAGGAGATGAGAATGAGAGCCACCATATTG 

AAGGAAGAGGTTGAAGCCTCTGTCACAACCGAAGGCTCTTCTCACAATTCTTTAAACAAT 
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TTGGTCCATGCAATAATGATGCAAATTGACGAACAMGA 



>UGT76E1 ' 

ATGGAAGAACTAGGAGTGAAGAGAAGGATAGTATTGGTTCCAGTTCCAGCACAAGGTCAT 

GTAACTCCQATTATGCAACTCGGGAAGGCTCTTTACTCCAAGGGCTTCTCCATCACTGTT 

GTTCTCACACAGTATAATCGAGTTAGCTCATCCAAGGACTTCTCTGATTTTCATTTCCTC 

ACCATCCCAGGCAGCTTGACCGAGTCTGATCTCAAAAACCTTGGACCATTCAAGTTTCTC 

TTO^GCTCAATCaAATTTGCGAGGCAAGCTTa^GCAATGTATTGGTC^ 

GAGCAAGGTAATGATATCGCTTGTGTCGTCTACGATGAGTACATGTACTTCTCCCAAGCT 

GCAGTTAAAGAGTTTCAACTTCCTAGCGTCCTCTTCAGCACGACAAGTGCTACTGCCTTT 

GTCTGTCGCTCTGTTTTGTCTAGAGTCAACGCAGAGTCATTCTTGCTTGACATGAAAGAT 

CCCAAAGTGTCAGACAAGGAATTTCCAGGGTTGCATCCGCTAAGGTACAAGGACCTGCCA 

ACTTCAGCATTTGGGCCATTAGAGAGTATACTCAAGGTTTACAGTGAGACTGTCAACATT 

CGAACAGCTTCGGCAGTTATCATCAACTa^ACAAGCTGTCTAGAGAGCTCATCTTT^^^ 

TGGTTACAAAAACAACTGCAAGTTCCAGTGTATCCTATAGGCCCACTTCACATTGCAGCT 

TCAGCGCCTTCTAGTTTACTTGAAGAGGACAGGAGTTGCCTTGAGTGGTTGAACAAGCAA 

AAAATAGGCTCAGTGATTTACATAAGTTTGGGAAGCTTGGCTCTAATGGAAACTAAAGAC 

ATGTTGGAGATGGCTTGGGGTTTACX3TAATAGCAACCAACCTTTCTTATGGGTGATCCGA 

CCGGGTTCTATTCCCGGCTCGGAATGGACAGAGTCTTTACCGGAGGAATTCAGTAGGTTG 

GTTTCAGAAAGAGGTTACATTGTGAAATGGGCACCACAGATAGAAGTTCTCAGACATCCT 

GOVGTGCMAGGGTTTTGGAGTCACTGCGGATGGAACTCGACCCTAiSAGAGCATCGGGGAA 

GGAGTTCCGATGATCTGTAGGCCTTTTACGGGAGATCAGAAAGTCAATGCGAGGTACTTA 

GAGAGAGTTTGGAGAATTGGGGTTCAATTGGAAGGAGAGCTGGATAAAGGAACAGTGGAG 

AGAGCTGTAGAGAGATTGATTATGGATGAAGAAGGAGCAGAAATGAGGAAGAGAGTTATC 

AACTTGAAAGAGAAGCTTCAAGCCTCTGTCAAGAGTAGAGGTTCCTCATTCAGCTCATTA 

GACAACTTTGTCAATTCCTTAAAAATGATGAATTTCATGTAG 



>UGT76E11 , 

ATGGAGGAAAAGCCGGCGGGCAGAAGAGTAGTGTTGGTTGCAGTTCCAGCTCAAGGACAT 

ATCTCTCCAATAATGCAACTTGCAAAAACACTTCACTTGAAGGGTTTCTCAATCACAATC 

GCTCAGACAAAGTTCAATTACTTTAGCCCTTCAGATGACTTCACTGATTTTCAGTTTGTC 

ACCATTCCAGAAAQCTTACCAGAGTCTGATTTTGAGGATCTCGGGCCAATAGAGTTTCTG 

CATAAGCTCAACAAAGAGTGTCAGGTGAGCTTCAAAGACTGTTTGGGTCAGTTGTTGCTG 

GAACAAGGTAATGAGATAGCCTGTGTTGTCTACGACGAGTTCATGTACTTTGCTGAAGCT 

GCAGCCAAAGAGTTTAAGCTTCCAAACGTCATTTTCAGCACCACAAGTGCCACGGCTTTT 

GTTTGCCGCTCTGCATTCGACAAACTTTATGCAAACAGTATCCTGACTCCCTTGAAAGAA 

CCCaVAAGGACT^CAAAACGAGCTAGTGCCAGAGTTTCATCCCCTGAGAtGa^ 

CCGGTTTCACATTGGGCATCATTAGAAAGCATGATGGAGCTGTATAGGAATACAGTTGAC 

AAACGGACAGCTTCCTCGGTGATAATCAACACAGCGAGCTGTCTAGAGAGCTCATCTCTG 

TCTCGTCTGCAGO^CAGCTACAAATTCCAGTT^ATCCTATAGGCCCT 

GCATCAGCTTCTACGAGTCTTCTTGAAGAGAACAAGAGCTGTATTGAATGGTTGAACAAA 

CaAAAGAAAAACTCTGTGATATTCGTAAGCTTGGGAAGCTTAGCTTTGATGGAi^TCaAT 

GAGGTQATAGAAACTGCTTTGGGATTGGATAGTAGO^GCAACAGTTCTTGTGGGTCATT 

CGGCCAGGGTCAGTACGTGGTTCGGAATGGATAGAGAACTTGCCTAAGGAGTTTAGTAAG 

ATAATTTCGGGTCGAGGTTACATTGTGAAATGGGCTCCACAGAAGGAAGTACTTTCTCAT 

CCTGCAGTAGGAGQATTTTGGAGCCATTGCGGATGGAACTCGACACTAGAGAGCATCGGG 

gaaggagttccaatgatttgcaagccgttttccagtgatcaaatggtgaatgcgagatac 
ttggagtgtgtatggaaaattgggattcaagttgagggtgatctagacagaggagcggtc 

GAGAGAGCTGTGAGGAGiSTTAATGGTGGAGGAAGAAGGGGAGGGGATGAGGAAGAGAGCT 
ATCAGTTTGAAAGAGCAACTTAGAGCCTCTGTTATAAGTGGAGGTTCTTCy^CACAACTCG 
CTAGAGGAGTTTGTACACTACATGAGGACTCTATGA 
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>UGT76E12 

ATGGAGGAAAAGCCTGCAAGGAGAAGCGTAGTGTTGGTTCCATTTCCAGCACAAGGACAT 

ATATCTCCAATGATGCAACTTGCCAAAACCCTTCACTTAAAGGGTTTCTCGATCACAGTT 

GTTCAGACTAAGTTCAATTACTTTAGCCCTTCAGATGACTTCACTCATGATTTTCAGTTC 

GTCACCATTCCAGAAAGCTTACCAGAGTCTGATTTCAAGAATCTCGGACCAATACAGTTT 

CTGTTTAAGCTCAACAAAGAGTGTAAGGTGAGCTTCAAGGACTGTTTGGGTCAGTTGGTG 

CTGCAACAAAGTAATGAGATCTCATGTGTCATCTACGATGAGTTCaiTGTACTTTGCTGA^ 

GCTGCAGCCAAAGAGTGTAAGCTTCCAAA(I?^TC7^TTTTCAGCACAACAAGTGCC^^ 

TTCGCTTGCCGCTCTGTATTTGACAAACTATATGCAAACAATGTCCAAGCTCCCTTGAAA 

GAAACTAAAGGAOVACAAGAAGAGCTAGTTCCGGAGTTTTATCCCTTGAGATATAAAGA^ 

TTTCCAGTTTCACGGTTTGCATCATTAGAGAGCATAATGGAGGTGTATAGGAATACAGTT 

GACAAACGGACAGCTTCCTCGGTGATAATCAACACTGCGAGCTGTCTAGAGAGCTCATCT 

CTGTCTTTTCTGCaLAC^CAACAGCTACAAATTCCAGTGTATCCT^ 

ATGGTGGCCTCAGCTCCTACAAGTCTGCTTGAAGAGAACAAGAGCTGCATCGAATGGTTG 

AACAAACAAAAGGTAAACTCGGTGATATACATAAGCATGGGAAGCATA 

ATCAACGA6ATAATGGAAGTCGCGTCAGGATTGGCTGCTAGCAACCAACAOT 

GTGATCCGACCAGGGTCAATACCTGGTTCCGAGTGGATAGAGTCCATGCCTGAAGAGTTT 

AGTAAGATGGTTTTGGACCGAGGTTACATTGTGAAATGGGCTCCACAGAAGGAAGTACTT 

tctcatcctgcagtaggagggttttggagccattgtggatggaactcgacactagaaagc 
atcggccaaggagttccaatgatctgcaggccattttcgggtgatcaaaaggtgaacgct 
agatacttggagtgtgtatggaaaattgggattcaagtggagggtgagctagacagagga 
gtggtcgagagagctgtgaagaggttaatggttgacgaagaaggagaggagatgaggaag 
agagctttcagtttaaaagagcaacttagagcctctgttaaaagtggaggctcttcacac 

AACTCGCTAGAAGAGTTTGTACACTTCATAAGGACTCTATGA 



>UGT76E2 . 

ATGGAGGAAAAGCAAGTGAAGGAGACAAGGATAGTGTTGGTTCCAGTTCCAGCTCAAGGT 

CATGTAACTCCGATGATGCAACTAGGAAAAGCTCTTCACTCAAAGGGTTTCT^ 

GTTGTTCTGACACAGTCTAATCGAGTTAGCTCTTCCAAAGACTTCTCTGAOT 

CTCACCATCCCAGGCAGCTTAACTGAGTCTGATCTCCAAAACCTAGGACCACAAAAGTTT 

GTGCTCAAGCTCAATCT^TTTGTGAGGO^GCTTCAAGCAGTGTATAGGTC^ 

CATGAACAATGTAATAATGATATTGCTTGTGTCGTCTACGATGAGTACATGTACTTCTCT 

catgctgcagtaaaagagtttcaacttcctagtgtcgtcttta(3cacgacaagtgctact 

gcttttgtctgtcgctctgttttgtctagagtcaacgcagagtcgttcttgatcgacatg 

aaagatcctgaaacacaagacaaagtatttccagggttgcatcctctgaggtacaaggat 

ctaccaacttcagtatttgggccaatagagagtacgctcaaggtttacagtgagactgtg 

.aacactcgaacagcttccgctgttatcatcaactcagcaagctgtttagagagctcatct 

ttggcaaggttgcaacaacaactgcaagttccggtgtatcctataggcccacttcatatt 

acagcttcagcgccttctagtttactagaagaagacaggagttgcgttgagtggttgaac 

aagcaaaaatcaaattcagttatttacataagcttgggaagcttggctct;^ 

aaagagatgttggagatggcttggggattaagtaatagcaaccaacctttcttatgggtg 

gtcagaccgggctctattccggggtcagaatggacagagtccttaccagaggaattcaat 

aggttggtttcagaaagaggttacattgtgaaatgggctccgcagatggaagttctcaga 

catcdtgcagtaggagggttttggagtcactgtggatggaactcaacagtagagagcatc 

ggggaaggagttccgatgatatgtaggcctttcaccggggatcagaaagtcaatgcgagg 

TACTTAGAGAGAGTTTGGAGAATTGGGGTTCAATTGGAGGGAGATCTGGATAAAGAAACT 
GTGGAGAGAGCTGTAGAGTGGTTGCTTGTGGATGAAGAAGGAGCAGAAATGAGGAAGAGA 
GCCATTGACTTGAAAGAAAAGATTGAAACCTCTGTTAGAAGTGGAGGTTCCTCATGCAGC 
TCACTAGACGACTTTGTTAATTCCATGTGA 



54 



wo 2004/113540 PCT/GB2004/002432 



>UGT76E3 . 

ATGGAGAAAAGAGTAGAGAAGAGAAGGATAGTGTTGGTTCCA.CTTCCATTACTAGGAC^ 

TTCACTCCGATGATGCAACTCGGCCAAGCCCTTATCTTGAAGGGATTCTCAATTATAGTT 

CCTCAGGGAGAATTCAATCGAGTAAACTCTTCGCAGAAGTTCCCTGGTTTTCAATTTATC 

ACCATACCAGATTCTGAACTCGAGGCAAATGGACCAGTCGGGTCTCTAACACAGC 

AAAATTATGGAGGa\AGCTTCAAGGACTGTATAAGGCAGTTGTTGAAACAA(^ 

GATATTGCATGTATCATCTACGACGAGTTCATGTATTTTTGTGGAGCCGTAGCTGAGGAG 

TTGAAGCTTCCO^TTTCATCTTCAGTACTCAAACTGCTACACATAAAGTTTGCTGC^ 

GTTTTAAGCJ^CTTAATGCCAAGAAGTACTTGATCGACATGGAAGAGCATGACGTGG^ 

AACaAGGTAGTGGAAAATATGCATCCATTAAGATACAAAGACTTACCA^ 

6GAGAACTAGAACCTTTTTT6GAGCTCTGTAGAGATGTA6TCAAai?U^ 

GCTGTTATCATCAACACCGTGACCTGTCTAGAGAGCTCGTCTCTCACAAGGCTGCAACAA 

GAACTCCaJ^TTCCGGTGTATCCATTAGGCCCTCTTCACATTACAGATTCATCGACAGGA 

TTTACTGTGCTGCAAGAGGATAGGAGCTGCGTTGAATGGCTGAACAAGCAGAAACCAAGG 

TCTGTCATATACATAAGTTTAGGAAGCATGGTTCTCATGGAAACCAAGGAGATGTTAGAG 

ATGGCTTGGGGAATGTTGAATAGCAACCAACCTTTCTTATGGGTCATCCGACCTGGATCT 

GTCTCA6GGTCCGAGGGGATAGAGTCATTGCCAGAGGAAGTCAGTAAGATGGTTTTAGAG 

AAAGGATACATTGTGAAATGGGCACCACAAATAGAAGTACTAGGACATCCCTCAGTGGGA 

GGCTTTTGGAGCCACTGTGGATGGAACTCAACACTCGAGAGQATTGTGGAAG^ 

ATGATTTGCAGGCCTTATCAAGGCGAGCAGATGTTAAATGCAATATATCTAGAGA6TGTA 

TGGAGAATAGGGATTCAGGTAGGAGGTGAACTGGAAAGAGGAGCCGTCGAGAGAGCTGTG 

AAGAGGTTGATTGTGGATAAAGAAGGTGCAAGCATGAGGGAGAGAACCCTTGTTTTAAAA 

GAGAAGCTCAAAGCCTCTATTAGAGGTGGAGGCTCCTCATGCAATGCATTAGATGAGCTT 

GTCAAGCACTTGAAGACAGAGTGA . ^ 



>uGT76E4 ■ ; : 

ATGGAGAAAAGGGTAGAGAAGAGAAG6ATT6TGTTAGTTCCGGTTGCTGCACAAGGACAT 
GTAACCCCAATGATGCAGCTTGGGAAAGCCCTTCAATCAAAGGGCTTCTTAATTACTGTT 
GCTCAGAGACAGTTCAATCT^TAGGCTCATCATTGCa^ 

GTCACCATACCAGAAAGCTTACCTCAGTCTGAATCTAAGAAACTAGGACCAGCTGAGTAT 

CTTATGAATCTCAACAAAACAAGCGAGGCAAGCTTCAAGGAGTGTATAAGTCAGTTATCG 

ATGCAACAAGGCAATGATATAGCATGTATCATCTATGACAAGCTTATGTACTTCTGTGAA 

GCAGCAGCTAAGGAGTTTAAGATTCCTAGTGTTATCTTCAGCACTAGCAGTGCTACAATT 

CAAGTTTGCTACTGTGTTTTAAGTGAACTGAGTGCCGAGAAGTTCTTGATCGACATGAAA 

GATCCTGAAAAGCAAGATAAGGTGTTGGAAGGTTTGCATCCTTTAAGGTACAAAGACC 

CCAACTTCAGGATTTGGACCATTAGAGCCACTTTTGGAGATGTGTAGGGAAGTAGTTAAC 

AAAAGAACAGCTTCCGCTGTTATCATCAACACGGCGAGCTGTCTAGAGAGCTTGTCTCTG 

TCATGGCTGCAACAAGAACTTGGAATTeCAGTGTATCCATTAGGCCCTCTTCACATTACA 

•GCTTCATCGCCGGGACCTAGTTTACTGCAAGAGGACATGAGCTGCATTGAATGGCTGAAC 

AAGCAGAAACCAAGGTCAGTCATATACATJ^GCTTGGGAACCAAAGCTCA^^ 

AAGGAGATGTTAGAGATGGCCTGGGGATTGTTGAATAGCAACCAACCTTTCTTATGGGTC 

ATCCGACCTGGCTCTGTTGCAGGCTTCGAGTGGATAGAGTTATTACCAGAGGAAGTCATT 

AAGATGGTAACAGAAAGAGGATACATAGCGAAATGGGCACCGCAGATAGAAGTACTTGGA 

CATCCTGCAGTGGGAGGATTCTGGAGCCACTGTGGATGGAACTCAACACTCGAGAGTATT 

GTGGAAGGAGTCCCAATGATTTGCAGGCCTTTACAAGGCGAACAAAAGTTAAATGCGATG 

TATATAGAAAGTGTTTGGAAAATAGGGATTCAACTTGAAGGTGAAGTGGAAAGGGAAGGT 

GTAGAGAGAGCTGTGAAGAGGTTGATCATAGATGAAGAAGGTGCAGCCATGAGGGAGAGG 

GCTCTTGATTTAAAAGAGAAGCTCAATGCCTCGGTAAGAAGTGGAGGCTCCTCATACAAC 

GCACTGGATGAGCTTGTCAAGTTCTTGAATACAGAGTGA 



>UGT76E5 
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ATGGAGAi\AAATGCAGAGAAGAAAAGAATAGTGTTGGTTCCATTTCCATTACAAGGACAT 
ATCACTCCAATGATGCAACTTGGTCAAGCACTTAACCTGAAAGGCTTCTCGATTACCGTT 
GCTCTTGGAGATTCCAATCGAGTAAGTTCTACGCAACACTTCCCTGGTTTTCAATTTGTC 
ACAATACCTGAAACCATACCACTATCTCAACACGAGGCACTCGGAGTTGTCGAGTTTGTG 
GTTACGCTCAACAAAACAAGCGAGACAAGTTTCAAGGACTGTATAGCTCATTTGTTGCTG 
CAACATGGAAATGATATTGCTTGTATCATTTACGACGAGCTCATGTACTTCTCTGAAGCT 
ACAGCTAAGGATTTAAGGATTCCTAGTGTCATATTCACCACTGGTAGTGCTACAAATCAT 
GTTTGTTCTT6TATTTTAAGCAAACTCAACGCCGAGAAGTTCTTGATCGACATGAAAGAT 
CCTGAAGTGCAAAACATGGTGGTGGAAAATTTACATCCACTAAAATACAAAGACTTACCA 
ACTTCAGGAATGGGGCCGCTAGAGCGATTTTTGGAGATTTGTGCCGAAGTTGTCAACAAA 
AGAACAGCTTCCGCTGTTATAATCAATACGTCAAGTTGTCTAGAGAGCTCGTCTCTGTCA 
TGGCTGAAACAAGAACTCAGTATTCCAGTGTATCCATTAGGCCCTCTTCACATTACAACT 
TCAGCAAATTTTAGTTTACTTGAAGAGGACAGGAGCTGCATTGAATGGCTGAACAAGCAG 
AAACTGAGGTCAGTTATATACATAAGCGTAGGAAGCATAGCTCACATGGAAACCAAGGAA 
GTATTGGAGATGGCTTGGGGATTGTATAATAGCAACCAACCTTTTCTATGGGTAATCCGA 
CCCGGTACAGAGTCAATGCCAGTGGAAGTCAGTAAGATTGTCTCGGAAAGAGGATGCATT 
GTGAAATGGGCGCCyiCAGAATGAAGTACTTGTGCATCCTGCAGTGGGAGGTTTCTGGAGC 
CACTGTGGATGGAACTCAACACTCGAGAGTATTGTGGAAGGAGTTCCAATGATTTGCAGA 
CCGTTTAACGGTGAGCAGAAGTTAAACGCGATGTATATAGAAAGTGTTTGGAGAGTAGGG 
GTTCTGCTTCAAGGAGAAGTGGAGAGAGGATGTGTAGAGAGAGCTGTGAAGAGGTTGATT 
GTGGATGATGAAGGTGTAGGAATGAGGGAGAGAGCCCTTGTTTTAAAAGAGAAGCTCAAT 
GCCTCTGTAAGAAGTGGAGGCTCTTCATACAATGCATTGGATGAGCTCGTCCATTACTTG 
GAGGCAGAGTATAGAAATACTTGA 



>UGT76E6 ■ ; 

ATGGAGAAAATGGAAGAGAAGAAAAGGATAGTGTTAGTTCCGGTTCCAGCACAAAGACAT 

GTAACTCCAATGATGCAGCTTGGOVCAGCCCTAAACATGAAGGGCTTCTCTATTACTGTT 

GTTGAAGGACAGTTCAATAAAGTAAGCTCATCTCAAAACTTTCCTGGTTTTCAATTTGTA 

ACCATACCAGATACAGAGAGCTTGCCAGAGTCTGTGCTCGAGAGACTCGGACCGGTCGAG 

TTTTTATTCGAGATCAACWf^CCAiSTGAGGCAAGCTTCAAGGACTGTATAAGGCAGTCG 

TTGCTGCAACAAGGCAATGATATAGCATGTATCATCTACGACGAGTATATGTACTTCTGT 

GGAGCTGCAGCTAAGGAGTTCAACCTTCCTAGTGTAATATTCAGCACACAAAGTGCTACT 

AATCAAGTTTCCCGTTGCGTTTTAAGAAAACTCAGTGCCGAGAAGTTCTTGGTGGACATG 

GAAGGTATCCTGAAGTGCAGGAAACGTTGGTGGAAAATTTGCATCCATTAAGATACAAAG 

ACCTACCAACTTCAGGAGTTGGGCCACTAGATCGATTATTTGAGCTCTGTAGGGAAATAG 

TCAACAAAAGAACAGCTTCCGCTGTTATCATCAACACAGTGAGATGTCTAGAGAGCTCGT 

CTCTGAAACGTCTGCAACATGAACTCG6GATTCCGGTGTACGCATTAGGCCCTCTTCACA 

TTACAGTTTCAGCAGCTTCTAGTTTACTGGAAGAGGACAGGAGCTGCGTTGAATGGTTGA 

ACAAGCAAAAACCGAGGTCAGTCGTTTACATAAGCTTGGGGAGCGTAGTTCAAATGGAAA 

CCAAAGAAGTGTTAGAGATGGCTCGGGGTTTATTTAATAGCAACCAGCCTTTCTTATGGG 

TCATTCGGCCTGGCTCTATCGCAGGCTCCGAATGGATAGAGTCACTGCCAGAGGAAGTCA 

TTAAGATGGTCTCCGAAAGAGGGTATATTGTGAAATGGGCACCACAGATAGAAGTACTTG 

GACATCCTGCAGTGGGAGGATTCTGGAGCCACTGTGGATGGAACTCAACGCTTGAAAGCA 

TTGTGGAAGGAGTTCCAATGATATGCAGGCGCTTTCATGGCGAGCAAAAGTTAAACGCAC 

TGTGTTTAGAGAGTATTTGGAGAATAGGGTTTCAGGTGCAAGGTAAGGTAGAGAGGGGAG 

GGOTCGAGAGAGCTGTGAAGAGGTTGATAGTGGATGAAGAAGGTGCAGACATGAGAGAGA 

GAGCCCTTGTTTTAAAAGAGAATCTCAAAGCCTCTGTAAGAAATGGAGGCTCCTCATACA' 

ACGCATTGGAGGAGATCGTTAACCTCATGTAG 



>UGT76E7 

ATGGAGGAGAAGCTCTCGAGGAGAAGAAGAGTAGTGTTGGTTCCAGTTCCAGCTCAAGGA 
CATATAACTCCAATGATACAACTTGCAAAAGCACTTCACTCAAAAGGCTTCTCTATTACA 
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GTTGTTCAAACO^GTTCyVACTACTTAAACCCTTCAAATGATTTGTCTGATTTTCAGTTT 
GTAACCATCCCAGAGAACTTAQCAGTGTCTGATCTTAAGAATCTAGGACCAGGACGGTTT 
CTGATTAAGCTAGCTAATGAGTGTTATGTTAGCTTTAAGGATTTGTTAGGTCAGTTGTTG 
GTTAATGAAGAAGAAGAGATCGCTTGTGTTATCTACGACGAGTTCATGTACTTTGTTGAA 
GTAGCAGTTAAAGAGTTTAAGCTTCGTAATGTTATTTTAAGTACTACAAGTGCAACGGCT 
TTTGTTTGTCGCTTTGTTATGTGTGAACTCTATGCTAAAGATGGTTTGGCTCAACTTAAA 
GAAGGCGGTGAGCGAGAAGTGGAGTTAGTACCGGAGTTGTATCCTATACGGTACAAAGAT 
TTACCAAGTTCGGTATTTGCATCTGTAGAATCTTCAGTGGAGTTGTTTAAGAATACATGT 
TATAAAGGGACAGCTTCCTCTGTGATAATCAACACAGTGAGGTGTCTAGAGATGTCATCT 
TTGGAGTGGCTTCAACAAGAAqTTGAAATCCCGGTGTATTCTATAGGCCCGCTTCATATG 
GTGGTGTCAGCTCCTCCTACmGTCTTTTAGAAGAGAACGAGAGCTGTATAGAATGGTTG 
AACRAACAAAAGCCGAGCTCGGTGATATACATAAGCTTGGGAAGTTTTACTTTGATGGAA 
ACTAAAGAAATGTTGGAGATGGCTTATGGGTTTGTTAGTAGTAACCAACACTTCTTGTGG 
GTGA'ITCGACCGGGATCTATATGTGGTTCTGAAATCTCTGAGGAAGAGTTGTTGAAGAAG 
ATGGTAATTACGGATCGAGGTTACATTGTGAAATGGGCGCCGCAAAAACAAGTGCTTGCA 
CATTCTGCGGTTGGAGCGTTCTGGAGTCATTGTGGATGGAACTCGACTTTAGAAAGTCTT 
GGTGAAGGAGTTCCATTGATATGTAGGCCTTTTACTACTGATCAAAAGGGGAATGCAAGG 
TACTTGGAGTGTGTGTGGAAAGTAGGAATTCAAGTGGAGGGTGAGCTAGAGAGAGGCGCA 
ATCGAGAGAGCTGTGAAGAiSGTTAATGGTGGATGAAGAAGGAGAAGAGATGAAGAGAAGA 
GCTCTAAGTTTAAAAGAGAAACTCAAAGCCTCTGTTTTAGCTCAAGGTTCTTCACATAAA 
TCACTAGATGACTTCATCAAGACTCTGTGA 



>UGT76E9 ; 

ATGGAGGAAAAGCAAGAGAGGAGGAGAAGGATCGTGTTGATTCCCGCTCCAGCACAAGGA 
CACATATCTCCGATGATGCAACTTGCAAGAGCCCTTCACTTAAAGGGCTTCTCCATTACA 
GTTGCTCAAACCAAGTTCAATTACTTGAAGCCTTCAAAAGACTTAGCTGATTTTCAGTTT 
ATCACCATCCCAGAGAGCTTACCAGCCTCGGATCTTAAGAATCTAGGACa^GTTTGGTTT 
CTTCTTAAACTCAATAAAGAGTGTGAGTTTAGCTTCAAGGAGTGTTTAGGTCAATTGTTG 
CTGCAAAAACAACTTATACCGGAAGAAGAGATCGCTTQTGTCATCTACGACGAGTTCATG 
TACTTTGCTGAAGCTGCAGCCT^GAGTTTAACCTTCCCAAAGTTATTTTCAGTACCGAA 
AATGCGACGGCTTTTGCTTGTCGCTCTGCCATGTGCAAACTCTATGCAAAAGATGGTTTG 
GCTCCCCTTAAAGAAGGATGTGGGCGAGAAGAGGAGCTAGTGCCAAAGTTGCATCCCCTT 
AGATACAAAGACCTACCAACTTCAGCATTTGCACCAGTAGAAGCCTCAGTGGAAGTGTTT 
AAAAGTTCATGTGATAAAGGGACAGCTTCCGCTATGATAATCAACACAGTGAGGTGTCTA 
GAGATATCATCCTTGGAGTGGCTTCAACAAGAACTTAAGATTCCGATATATCCTATAGGC 
CCTCTTCACATGGTTTCTTCAGCTCCTCCTACGAGTCTACTAGACGAGAATGAGAGTTGC 
ATTGATTGGCTGAACAAACAAAAGCCGAGCTCGGTGATTTACATAAGTTTGGGAAGCTTT 
ACTTTGTTGGAAACTAAAGAAGTGTTGGAAATGGCTTCGGGCTTGGTTAGTAGTAACCAA 

cacttcttgtgggtgattcgacccgggtccatacttggttctgaattgactaatgaggaa 
ttattgagtatcatggaaataccggatcgaggctacattgtgaaatgggctccacaaaag 

CAAGTGCTTGCACATTCTGCGGTTGGAGCATTTTGGAGTCATTGTGGATGGAACTCGACT 

ctagagagcatgggtgaaggagttccgatgatttgtaggccttttactactgatcaaaag 
gtaaatgcgcggtatgtggagtgtgtctggagagttggggttcaagtggagggtgtiacta 

aagagaggagtagtcgagagagctgtgaagaggttactggtggatgaagaaggagaagag 

ATGAAGTTGAGAGCTCTCAGTTTGAAAGAGAAACTCAAAGTTTCTGTTCTACCGGGAGGT 

tcttcacacagttcactagatgacttaatcaagactctatga 



>UGT76F1- . ; • 

atggaagagagaaaagtgaagagaattatcatgttccctctaccgtttacaggacacttc 
aaccctatgatcgagcttgctggaatattccacaaccgtggcttctccgtcacgatactc 

CACACTTCTTTCAACTTCCCGGATCCTTCTCGCCATCCACAGTTTACTTTTCGAACTATC 
ACTCACAAAAACGAA.GGAGAAGAAGACCCTCTCTCTCAATCAGAAACTTCTTCGGGTAAG 
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GACCTCGTCGTCCTTATTAGTCTGCTGAAACAATACTACACCGAGCCGTCTCTT6CAGAG 

GAAGTAGGCGAAGGAGGGACGGTGTGTTGTTTGGTCTCCGACGCTCTATGGGGGAGGAAC 

ACGGAGATTGTAGCGAAAGAGATTGGAGTGTGTACAATGGTGATGAGGACTAGTGGTGCG 

GCAACGTTTTGTGCTTATACAGCTTTCCCTCTCCTTATAGATAAGGGTTACCTTCCTATA 

CAA6GTTCTAGATTAGATGAGCTAGTGACAGAGCTTCCACCTTTGAAAGTGAAGGATCTT 

CCTGTAATAAAAACGAAAGAGCCTGAGGGACTAAACCGAATACTTAACGACATGGTGGAA 

GGAGCCAAGTTATCTTCCGGAGTCGTATGGAACACATTTGAAGATCTTGAAAGACATTCA 

CTCATGGATTGTCGCAGCAAGTTACAAGTTCCGTTGTTCCCAATCGGACCGTTTCAC^^ 

CATAGAACCGATCTTCCACCGAAGCCAAAGAACAAGGACAAGGACGATGATGAAATATTA 

ACCGATTGGCTTAACAAGCAAGCTCCGCAGTCTGTGGTCTATGTGAGTTTTGGAAGCCTT 

GCAGCTATAGAAGA6AATGAGTTTTTCGAAATTGCTTGGGGTCTAAGAAACAGCGAACTA 

CCATTCTTGTGGGTGGTTAGGCCCGGGATGGTCCGGGGAACCGAGTGGCTTGAGTCATTG 

CCTTGTGGGTTTTTGGAAAATATTGGTCATCAGGGAAAAATTGTGAAATGGGTGAATCAA 

CTAGAGACATTGGCCCATCCTGC6GTTGGAGCGTTTTGGACGCACTGTGGATGGAACTCA 

ACAATAGAGAGCT^TATGTGAAGGTGTTCCAATGATATGTACGCCGTGTTTCTCGGACCAG 

CATGTGAACGCGAGGTACATCGTTGATGTATGGCGAGTCGGGATGATGTTAGAGAGATGT 

AAGATGGAAAGGACGGAGATTGAGAAGGTAGTAACAAGTGTAATGATGGAGAATGGAGCT 

GGATTGACAGAGATGTGTTTGGAGTTGAAAGAGAAAGCTAATGTTTGCTTAAGTGAAGAT 

GGGTCTTCTTCCAAGTATCTAGAC^U^CTTGTCAGTCATGTCCTGTCTTTTGA^ 

GCTTTTGCAAGTTAA . " " 



>UGT76F2 . 

ATGGAAGAGAGAAAAGGGAGGAGAATAATCATGTTCCCTCTTCCATTTCCAGGGCACTTC 

AACCCCATGATCGAGCTCGCTGGAATATTCCACCACCGTGGCTTCTCCGTGACGATCCTC 

CACACTTCCTACAACTTCCCCGATCCTTCTCGCCACCCACACTTCACTTTTCGAACCATC 

TCTCAC^^CAAAGAAGGAGAAGAAGATCCTCTGTCTCAGTCAGi^ 

CTAATCiSTTCTCGTTCGTCGGCTGAAACAACGCTACGCCGAACCGTTTCGTAAGTCTGTG 

GCGGCGGAAGTAGGTGGAGGAGAGACGGTGTGTTGTTTGGTCTCCGACGCTATATGGGGG 

AAGAACACGGAGGTTGTAGCGGAAGAGATTGGAGTTCGTAGGGTGGTGTTGAGGACAGGT 

GGTGCGTCGTCGTTTTGTGCTTTTGCCGCTTTCCCTCTCCTTAGGGATAAGGGTTACCTC 

CCTATACAAGATTCTAGATTAGATGAGCCAGTGACAGAGCTTCCACCTTTGA^ 

GATCTTCCGGTAATGGAAACGAATGAGCCGGAGGAACTTTACCGGGTAGTTAACGACATG 

GTGGAAGGAGCCAAGTCTTCTTCAGGAGTCATATGGAACACATTTGAAGATCTTGAAAGA 

CTAT(a.CTTATGAATTGTAGCAGCAAATTACAAGTTCCATTTTTCCCGA 

CACAAATATAGCGAAGATCCTACACCGAAGACAGAGAACAAGGAAGATACCGATTGGCTC 

GACAAGCAAGACCCACAGTCGGTGGTCTATGCGAGTTTCGGAAGCCTTGCAGCTATAGAA 

gagaaggagtttctcgagattgcttggggtctaagaaacagtgaacgaccgtttttgtgg 
gtggttaggccggggtctgtcagggggaccgagtggctcgagtcattgcctttagggttt 
atggaaaacattggagataagggaaaaatcgtgaaatgggcgaatcagttagaggtattg 
gcgcatcctgccattggagcgttttggacacattgtggatggaactcgacactagagagc 
atatgtgaAggtgttcctatgatatgtacgtcatgtttcacggaccagcatgtgaacgcg 
agatacatcgttgatgtatggcgagtcgggatgttgttagagagaagtaagatggaaaag 
aaggagattgaaaaggtgctaagaagtgtaatgatggagaagggagatggattgagggaa 
aggagtttgaagttgaaagagagagctgatttttgcttaagtaaagatgggtcttcttcc 

AAGTATTTAGAGAAACTTGTGAGTCATGTCCTGTCTTTTGATTCTTATGCTTTTGCAAGT 

TAA * ' . 



>UGT78D1 • 

ATGACCAAATTCTCCGAGCCAATCAGAGACTCCCACGTGGCAGTTCTCGCGTTTTTCCCC 
GTTGGCGCTCATGCCGGTCCTCTCTTAGCCGTCACTCGCCGTCTC6CCGCCGCTTCTCCC 
TCCACCATCTTTTCTTTCTTCAACACCGCAAGATCAAACGCGTCGTTGTTCTCCTCTGAT 
CATCCCGAGAACATCAAGGTCCACGACGTCTCTGACGGTGTTCCGGAGGGAACCATGCTC 
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GGGAATCCACTGGAGATGGTCGAGCTGTTTCTCGAAGCGGCTCCACGTATTTTCCGGAGC 

GAAATCGCGGCGGCAGAGATAGAAGTTGGAAAGAAAGTGACATGCATGCTAACAGATGCC 

TTCTTCTGGTTCGCAGCGGACATAGCGGCTGAGCTGAACGCGACTTGGGTTGCCTTCTGG 

GCCGGCGGAGCAAACTCACTCTGTGCTCATCTCTACACTGATCTCATCAGAGAAACCATC 

GGTCTCAAAGATGTGAGTATGGAAGAGACATTAGGGTTTATACCAGGAATGGAGAATTAC 

AGAGTTAAAGATATACCAGAGGAAGTTGTATTTGAAGATTTGGACTCTGTTTTCCCAAAG 

GCTTTATACCAAATGAGT.CTTGCTTTACCTCGTGCCTCTGCTGTTTTCATCAGTTCCTTT 

GAAGAGTTAQAACCTACATTGAACTATAACCTAAGATCCAAACTTAAACGTTTCTTGAAC 

ATCGCCCCTCTCACGTTATTATCTTCTACATCGGAGAAAGAGATGCGTGATCCTCATGGC 

TGCTTTGCTTGGATGGGGAAGAGATCAGCTGCTTCTGTAGCGTACATTAGCTTCGGCACC 

GTtATGGAACCTCCTCCTGAAGAGCTTGTGGCGATAGCACAAGGGTTGGAATCTAGC^^ 

GTGCCGTTTGTTTGGTCGCTGAAGGAGAAGAACATGGTTCATCTACCAAAAGGGTTTTTG 

GATCGGAOVAGAGAGCAAGGGATAGTGGTTCCTTGGGCTCCACAAGTGGAACTGCTGAAA 

CACGAGGCS^TGGGTGTGAATGTGACy^CaLTTGTGGATGGAACTCAGTGTTGGAGAG^^ 

TCGGCAGGTGTACCGATGATCGGCAGACCGATTTTGGCGGATAATAGGCTCAACGGAAGA 

GCAGTGGAGGTTGTGTGGAAGGTTGGAGTGATGATGGATAATGGAGTCTTCACGAAAGAA 

GGATTTGAGAAGTGTTTGAATGATGTTTTTGTTCATGATGATGGTAAGACGATGAAGGCT 

AATGCCAAGAAGCTTAAAGAAAAACTCCAAGAAGATTTCTCCATGAAAGGAAGCTCTTTA 

GAGAATTTCAAAATATTGTTGGACGAAATTGTGAAAGTTTAG 



>.UGT78D2 ■ • 

ATGACCAAACCCTCCGACCCAACCAGAGACTCCCACGTGGCAGTTCTCGCTTTTCCTTTC 

GGCACTCATGCAGCTCCTCTCCTCACCGTCACGCGCCGCCTCGCCTCCGCCTCTCCTTCC ' 

ACCGTCTTCTCTTTCTTCSU^CACCGCACAATCCAACTCTTCGTTATTTTCCTCCGGTGAC 

GAAGCAGATCGTCCGGCGAACATCAGAGTATACGATATTGCCGACGGTGTTCCGGAGGGA ■ 

TACGTGTTTAGCGGGAGACCACAGGAGGCGATCGAGCTGTTTCTTCAAGCTGCGCCGGAG 

AATTTCCGGAGAGAAATCGCGAAGGCGGAGACGGAGGTTGGTACGGAAGTGAAATGTTTG 

ATGACTGATGCGTTCTTCTGGTTCGCGGCTGATATGGCGACGGAGATAAATGCGTCGTGG 

ATTGCGTTTTGGACCGCCGGAGCAAACTCACTCTCTGCTCATCTCTACACAGATCTCA^ ■ 

AGAGAAACCATCGGTGTCAAAGAAGTAGGTGAGCGTATGGAGGAGACAATAGGGGTTATC 

TCAGGAATGGAGAAGATCAGAGTCAAAGATACACCAGAAGGAGTTGTGTTTGGGAATTTA . 

GACTCTGTTTTCTCAAAGATGCTTCATCAAATGGGTCTTGCTTTGCCTCGTGCCACTGCT 

GTTTTCATCAATTCTTTTGAAGATTTGGATCCTACATTGACGAATAACCTCAGATCGAGA 

TTTATVACGATATCTGAAOiTaSGTCCTCl'CGGGTTATTATCTTCTACATTGCAACAACTA 

GTGCAAGATCCTCACGGTTGTTTGGCTTGGATGGAGAAGAGATCTTCTGGTTCTGTGGCG 

TACATTAGCTTTGGTACGGTCATGACACCGCCTCCTGGAGAGCTTGCGGCGATAGCAGAA 

GGGTTGGAATCGAGTAAAGTGCC6TTTGTTTGGTCGCTTAAGGAGAAGAGCTTGGTTCAG 

TTACCAAAAGGGTTTTTGGATA(3GACAAGAGAGCAAGGGATAGTGGTTCCATGGGCACCG 

CAAGTGGAACTGCTGAAACACGAAGCAACGGGTGTGTTTGTGACGCATTGTGGATGGA^^ 

TCGGTGTTGGAGAGTGTATCGGGTGGTGTACCGATGATTTGqVGGCCATTTTTTGGGGAT 

CAGAGATTGAACGGAAGAGCGGTGGAGGTTGTGTGGGAGATTGGAATGACGATTATCAAT 

GGAGTCTTCACGAAAGATGGGTTTGAGAAGTGTTTGGATAAAGTTTTAGTTCAAGATGAT 

GGTAAGAAGATGAAATGTAATGCTAAGAAACTTAAAGAACTAGCTTACGAAGCTGTCTCT 

TCTAAAGGAAGGTCCTCTGAGAATTTCAGAGGATTGTTGGATGCAGTTGTAAACATTATT 

TGA 



>UGT78D3 

ATGGCCAAACCCTCGCAGCCAACGCGAGACTCCCACGTGGCAGTTCTCGTTTTCCCCTTC 
GGCAGTCATGCAGCTCCTCTCCTCGCCGTCACGTGCCGTCTCGCCACCGCTGCTCCCTCC 
ACCGTCTTCTCCTTCTTCAGCACCGCACGATCCAACTC6TCGTTACTCTCCTCCGATATC 
CCCACAAACATTCGTGTCCACAACGTCGATGACGGTGTTCCTGAGGGATTCGTGTTGACG 
GGGAATCCACAGCACGCTGTTGAGCTGTTTCTTGAA6CGGCGCCAGAGATTTTCCGAAGA 
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GAAATCAAGGCGGCCGAGACCGAAGTTGGTAGGAA6TTCAAGTGCATCCTTACGGATGCG 

TTCCTCTGGTTAGCAGCGGAGACGGCGGCTGCGGAGATGAAAGCGTCGTGGGTTGCGTAC 

TATGGAGGCGGAGCAACCTCGCTCACTGCTCATCTCTACACAGATGCCATCAGAGAAAAC 

GTCGGTGTCAAAAGTAGGTGAGCGTATGGAGGAGACAATAGGGTTTATCTCAGGAATGGA 

GAAGATCAGAGTCAAAGACACACAAGAAGGCGTTGTGTTTGGGAACTTAGACTCTGTTTT 

CTCTAAAACGTTGCACCAAATGGGTCTTGCTTTACCTCGTGCCACTGCTGTTTTG^TGAA 

TTCCTTTGAAGAATTGGATCCTACGTTTACAAATGATTTCAGATCGGAATTCAAACGTTA 

CCTAAACATCGGTCCTCTCGCTTTATTATCTTCTCCy^TCGCAAACATCAACGCTAGTG^ 

CGATCCTCACGGTTGCTTGGCTTGGATCGAGAAGCGGTCCACTGCTTCTGTAGCGTACAT 

TGCCTTTGGTAGAGTCGCGACACCGCCTCCTGTAGAGCTTGTGGCGATAGCACAAGGATT 

GGAATCGAGTAAAGTGCCTTTTGTTTGGTCGCTACAAGAGATGAAAAT6ACTCATTTACC 

AGAAGGCTTTTTGGATCGGACCAGAGAGCAAGGGATGGTGGTTCCATGGGCTlCCACyU^ 

GGAGCTGCTAAACCATGAAGCAATGGGTGTGTTTGTTTCGCATGGTGGGTGGAACTCAGT 

GTTGGA6AGTGTGTCGGCAGGTGTACCGATGATTTGTAGACCGATTTTCGGGGATCATGC 

AATCAATGCAAGATCTGTGGAAGCTGTGTGGGAGATCGGAGTGACGATTAGTAGTGGAGT 

CTTCACGZLAGGATGGATTTGAGGAGAGTTTGGATCGGGTTTTGGTTCAAGATGATGGCAA 

GAAGATGAAGGTTAATGCTAAAAAGCTTGAAGAACTAGCa^CAAGAAGCTGTCTCTACCAA 

AGGAAGCTCCTTTGAGAATTTTGGAGGATTGTTGGACGAAGTTGTGAACTTTGGATAA " 



>UGT79B1 • ■ ■ ■ , \ 

ATGGGTGTTTTTGGATCGAATGAATCGTCAAGCATGAGTATTGTGATGTATCCGTGGTTA 

GCCTTTGGTCACATGACTCGTTTTCTTCACCTATCCAACAAGCTCGCAGAGAAAGGTCAC 

AAGATTGTTTTCTTGCTTCCCAAGAAAGCACTAAACCAGCTTGAACCTCTTAATCTCTAC 

CCAAATCTCAtCACTTTCCACACCATCTCTATCCCTCAGGTCAAAGGGCTCC 

GCGGAGA(:i?U^CTCCGACGTCCCTTTCTTCTTGACACATTTGCTTGCAGTTGC^ 

CAAACCCGGCCAGAGGTCGAGACCATTTTCCGTACAATCAAACCGGACTTGGTTTTCTAT 

GATTCTGCCCATTGGATACCGGAAA.TTGCTAAACCGATCGGTGCTAAAACCGTTTGCTTC 

AA.CATCGTTAGCGCTGCGTCAATCGCACTGTCTCTTGTCCCTTCTGCGGAGAGAGAGGTC 

ATTGATGGCAAGGAAATGTCAGGGGAGGAGTTAGCTAAGACGCCTCTAGGTTACCCATCT 

TCGAAAGTAGTCTTACGTCCGCACGAAGCAAAATCCCTGAGTTTCGTGT6GAGGAAGCAC 

GAGGCGATTGGCTCTTTCTTTGATGGGAAAGTTACCGCGATGAGAAACTGCGACGCAATC 

GCTATAAGGACTTGCCGTGAGACAGAAGGCAAATTCTGCGATTACATAAGTAGGCAGTAC 

AGTAAACCGGTTTACCTAACAGGACCGGTTCTCCCTGGATCCCAACCTAATCAGCCCTCC 

TTAGATCCTCAATGGGCGGAGTGGCTAGCCAAATTCAACCACGGTTCGGTTGTGTTCTGC 

GCTTTCGGTAGCCAACCCGTTGTAAACAAGATAGATCAGTTTCAAGAACTCTGTTTAGGT 

CTAGAATCAACTGGTTTTCCGTTTCTGGTTGCCATTAAGCCTCCTTCGGGTGTATCAACC 

6TCGAGGAAGCCTTACCGGAAGGATTCAAAGAGAGGGTTCAAGGACGTGGCGTTGTGTTT 

GGAGGTTGGATTCAGCAACCGTTGGTGTTGAACCATCCTTCAGTGGGTTGTTTTGTTAGC 

CATTGCGGGTTTGGGTCGATGTGGGAGTCGTTGATGAGTGATTGTCAGATCGTTTTGGTT 

CCGCAGaiCGGAGAACAGATTTTGAAC6CAAGK3CTGA;T 

GTTGAAGTGGAGAGGGAAAAGAAAGGGTGGTTCTCGCGGCAAAGCTTGGAGAATGCTGTG 
AAGAGTGTGATGGAGGAAGGTAGTGAGATCGGTGAGAAAGTGAGGAAGAATCATGACAAG 
TGGAGATGTGTTTTGACTGACTCTGGTTTTTCAGATGGTTATATTGATAAGTTTGAACAA 
AATTTAATT6AACTTGTGAAGTCATGA 



>UGT79B10 

ATGGGCCAAACGTTTCACGCCTTTATGTTCCCATGGTTCGCTTTTGGTCATATGACTCCA 
TACTTGCATTTAGCCAACAAGTTAGCTGAGAGAGGTCACAGAATCACTTTCTTGATCCCC 
AAGAAAGCTCAGAAGCAGCTTGAACATCTCAATCTGTTTCCAGACAGCATCGTCTTTCAC 
TCTCTTACTATTCCTCAT6TTGATGGTCTCCCCGCTGGAGCCGAGACTTTCTCGGATATC 
CCTATGCCATTGTGGAAGTTCTTGCCCCCAGCTATAGATCTCACACGCGATCAAGTTGAA 
GCAGCGGTTAGTGCCTTGAGTCCGGACCTGATCTTGTTCGATATTGCTTCATGGGTTCCA 



60 



wo 2004/113540 



PCT/GB2004/002432 



GAAGTGGCTAAAGAGTATAGAGTCaAGAGTATGTTGTACy^CATCATATCAGCTACT^ 

ATAGCTCATGACTTTGTCCCAG6TGGTGAACTTGGAGTTCCTCCACCTGGTTATCCTTCC 

TCAAAGTTGTTGTACCGCAAACACGATGCTCACGCCTTGTTGTCCTTCTCCGTCTACTAC 

AAGAGGTTTTCTCATCGGCTCATCACAGGTCTTATGAATTGTGATTTCATTTCGATAAGG 

ACATGCAAAGAAATCGAGGGTAAATTCTGCGAGTATCTTGAGCGTCAATACCATAAAAAG 

GTTTTCTTGACGGGTCCAATGCTTCCTGAGCCAAACAAAGGTAAACCACTGGAAGATCGA 

TGGAGTCATTGGCTGAACGGGTTTGAACAAGGCTCTGTAGTGTTCTGTGCATTGGGAAGT 

CAAGTCACTCTAGAGAAGGACCAGTTCOAGAACTTTGTTTAGGAATAGAGCTTAC^^ 

TTACCGTTTTTTGTAGCTGTAACACCACCAAAAGGCGCAAAGACGATTCAAGATGCGTTA 

CCAGAAGGGTTCGAGGAGAGGGTGAAAGATC6TGGAGTGGTTTTGGGAGAATGGGTGCAA 

CAACCGTTATTATTGGCTCaiTCCATG^GTAGGCTGCTTCTTGAGTCATTGCG 

TCAATGTGGGAATCTATAATGAGTGATTGCCA^U^TAGTTTTGCTTCCATTTT^ 

CAAGTTCTCAACACAAGATTGATGACCGAAGAACTCAAGGTTTCGGTTGAAGTGCAAAGA 

GAAGAAACAGGATGGTTCTCGAAGGAGAGCTTGAGTGTTGCTATCACATCTGTGATGGAC 

CAAGCTAGTGAGATCGGGAATCTGGTGAGAAGGAACCATTCCAAATTGAAGGAGGTTTTG 

GTTAGTGATGGATTATTAACCGGTTACACCGATAAATTTGTTGACACTTTGGAGAATCTT 

GTCAGCGAGACAAAGCGTGAATGA . ' ■_ 



>UGT79B11 • ■_ 

ATGGGCCAAAAGATTCACGCTTTTATGTTCCCCTGGTTTGCTTTTGGTCATATGACTCCG 

TACTTGCATCTAGGCAACAAGTTAGCCGAGAAAGGTCATAGGGTTACTTTCTTGCTACCT 

AAGAAAGCTCAGAAACAATTGGAACATCAGAATCTATTTCCACACGGTATCGTCTTTCAT 

CCTCTTGTTATTCCTCATGTTGATGGCCTCCCTGCTGGTGCCGAGACAGCCTCGGATATC 

CCCATCTCGTTGGTGAAGTTCTTGTCTATAGCa^TGGATCTTACACGCGATCAGATCGAA 

GCCGCGATTGGTGCCTTGAGACCGGACCTAATCTTGTTCGATTTAGCTCACTGGGTTCCG 

GAAATGGCTAAAGCGCTTAAAGTCAAGAGTATGTTGTATAACGTGATGTCAGCTACCTCT 

ATAGCTCACGACCTTGTCCCAGGTGGTGAACTTGGAGTTGCTCCACCTGGTTATCCTTGA 

TCAAAGGCGTTGTACCGCGAACACGATGCTCACGCCTTGTTAACCTTCTCCGGCTTCTAC 

AAGAGGTTTTATCACCGGTTGACCACAGGTCTTATGAATTGCGATTTCATTTC6ATTCGG 

ACATGTGAAGAAATCGAAGGTAAATTTTGTGACTATATTGAGA|GTCAATACAAGAAGAAG 

GTTCTTTTAACCGGTCCAATGCTTCCCGAGCCTGACAAGAGTAAACCACTTGAAGATCAA 

TGGAGTCATTGGCTGAGTGGGTTTGGACAAGGCTCTGTAGTGTTCTGTGCATTGGGAAGT 

CAAACCATTCTAGAGAAAAACCAATTCCAAGAACTCTGTTTAGGAATAGAGCTTACGGGT 

TTACCa^TTTCTTGTCGCGGTTAAGCCACa^AAAGGCGCAAACACA^ 

CCAGAAGGGTTCGAGGAAAGGGTGAAGGGTCGTGGAATAGTTTGGGGAGAATGGGTGCAG 

CAACCATCCTGGCAACCATTGATATTGGCTCATCCATCAGTAGGTTGCTTTGTGAGCCAT 

TGCGGATTCGGGTCAATGTGGGAATCTTTAATGAGTGATTGTCAl^TAGTCTTTATTCCA 

GTTTTGAATGATCAAGTTCTCACCACGAGAGTAATGACGGAGGAACTCGAGGTCTCCGTT 

GAGGTACAGAGAGAAGAAACAGGATGGTTCTCAAAAGAAAACTTGAGTGGTGCAATCATG 

TCTTTGATGGACCAAGACAGCGAGATAGGiGAACCAAGTGAGGAGGAACCATTCTTVAA^ 

AAGGAGACTTTGGCTAGTCCTGGATTATTAACCGGTTACACCGATAAATTTGTTGACACT 

TTGGAGAATCTAGTCAACGAACAAGGATACATATCTTGA / . ■ 



1>UGT79B2 • 

ATGGGTGGTTTGAAGTTTCATGTACTTATGTATCCATGGTTCGCAACAGGCCATATGACC 
CCGTTCCTTTTTCTTGCCAACAAATTGGCTGAGAAAGGTCATACGGTCACTTTTTTGATT 
CCCJ^GAAAGCTCTGAAACAGTTGGAAAATCTCAATCTGTTTCCACACAACATTGTCTTT 
CGCTCTGTCACCGTCCCTCATGTGGATGGTCTCCCCGTTGGCACAGAGACAGTCTCTGAG 
AtCCCCGTGACATCAGCTGATCTCTTGATGTCTGCTATGGATCTeACACGTGATCAAGTT 
GAAGGTGTGGTCCGAGCCGTGGAACCGGACCTGATCTTCTTTGACTTCGCTCATTGGATT 
CCAGAGGTAGCTAGAGACTTTGGCCTTAAGACTGTAAAGTACGTCGTGGTATCTGCATCG 
ACTATAGCTAGTATGCTTGTTCCAGGTGGTGAGTTAGGTGTTCCTCCGCCGGGATATCCT 
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TCATCGAAGGTGCTGCTTCGTAAACAAGATGCTTACACCATGAAGAATCTGGAGTCTACA 
AATACAATCAATGTCGGACCAAACTTATTGGAAAGAGTCACTACAAGTCTTATGAACTCT 
GATGTCATTGCGATAAGGACAGCCAGAGAAATCGAAGGAAACTTTTGCGACTATATCGAA 
AAACATTGCAGGAAAAAGGTTCTCTTGACAGGTCCGGTGTTCCCTGAGCCAGACAAGACT 
AGAGAGCTAGAGGAACGATGGGTTAAGTGGCTAAGTGGGTATGAACCAGACTCAGTGGTG 
TTTTGTGCGTTGGGCTCACAAGTCATTTTAGAGAAAGATCAATTCCAAGAACTCTGCTTA 
GGAATGGAGCTAACAGGTTCACCGTTTCTTGTAGCGGTTAAGCCACCTAGAGGCTCATCA 
ACGATTCAAGAAGCACTTCCTGAAGGATTCGAGGAGAGGGTTAAAGGAAGAGGAGTTGTT 
TGGGGAGAATGGGTTCAACAACCATTGCTATTGTCTCATCCATCAGTCGGGTGCTTTGTG 
AGCCATTGTGGGTTTGGATCAATGTGGGAGTCTTTGCTGAGTGATTGTCAGATAGTCTTG 
GTACCACAGTTGGGTGATCAGGTCCTCAACACAAGATTGCTGAGTGACGAACTCAAGGTT 
TCGGTTGAAGTGGCAAGAGAGGAAACAGGATGGTTCTCGAAAGAGAGCTTGTTCGATGCT 
ATCAATAGTGTGATGAAAAGGGACAGTGAGATCGGGAATCTGGTGAAGAAGAATCACACC 
AAGTGGAGGGAGACACTAACTAGTCCTGGACTTGTGACCGGTTATGTCGATAATTTCATA 
GAGTCATTGCAGGATCTTGTCTCTGGGACCAACCATGTTTCGAAGTAG - 



>UGT79B3 • ■ 

ATGGGTGGTTTGAAGTTTCATGTACTTATGTATCCATGGTTCGCAACAGGCC^ 

CCGTTCCTTTTTCTTGCCAACAAATTGGCTGAGAAAGGTCATACGGTCACTTTCT^ 

CCCAAGAAATCTCTGAAACAGTTGGAAlCATTTCAATCTGTTTCCACACAACATTGTCTTT 

CGCTCTGTCACCGTCCCTCATGTGGATGGTCTCCCCGTTGGCACAGAGAGAGCCTCTGAG 

ATCCCTGTGACATCAACTGATCTCTTGATGTCTGCTATGGATCTCACACGTGATC^ 

GAAGCTGTGGTCCGAGCCGTTGAACCGGACCTGATCTTCTTTGACTTTGCTCATTGGATT 

CCAGAAGTAGCTAGGGACTTCGGCCTTAA6ACTGTAAAGTACGTCGTGGTGTCTGCATCG 

ACTATAGCTAGTATGCTTGTCCCAGGTGGTGAGTTAGGTGTTCCTCCACCGGGATATCCA 

TCATCAAAGGTGCTGCTTCGTi^CAAGATGCTTACACTATGAAGAAACTGGAGCCT^ 

AATACAATCGATGTCGGACCAT^CCTCTTGGAACGAGTCACTACAAGTCTT^^ 

GATGTCATTGCGATAAGGACAGCCAGAGAAATCGAAGGAAACTTTTGCGACTATATAGAA 

AAACATTGCAGGAAAAAGGTTCTCTTGACAGGTCCGGTGTTCCCTGAGCCAGACAAGACT 

AGAGAGCTAGAGGAACGATGGGTTAAGTGGCTAAGTGGGTATGAACCAGACTCAGTGGTG 

TTTTGTGCACTGGGCTCACAAGTCATTTTAGAGAAAGATCAATTCCAAGAACTCTGCTTA 

GGAATGGAGCTAACAGGTTCACCGTTTCTTGTAGCGGTTAAGCCCCCTAGAGGCTCATCA . 

ACGATTCAAGAAGCACTTCCTGAAGGATTCGAAGAGCGGGTTAAAGGAAGAGGCCTTGTT 

TGGGGAGGATGGGTTCAACAACCATTGATATTGTCTCATCCATCAGTCGGGTGCTTTGTG 

AGCCATTGTGGGTTTGGATCAATGTGGGAGTCTTTGCTGAGTGATTGTCAGATAGTCTTA 

GTACCACAGTTGGGTGATCAAGTCCTGAACACAAGATTGCTGAGTGACGAACTCAAGGTT 

TCGGfTGAAGtGGCAAGAGAGGAAACAGGATGGTTCTCGAAAGAGAGCTTGTGCGATGCT 

GTCAATAGTGTGATGAAAAGGGACAGCGAGCTCGGGAACCTGGTGAGGAAGAATCACACC 

AAGTGGAGGGAGACAGTAGCTAGTCCTGGACTAATGACTGGTTATGTCGATGCTTTCGTA 

GAGTCATTGCAGGATCTTGTCTCTGGGACCACCCATGACTGA . ' 



>UGT79B4 . 

ATGGGGTCAAAGTTTCATGCTTTTCTTTATCCATGGTTTGGTTTTGGTCATATGATTCCG 
TATCTTCATCTAGCTAACAAATTAGCTGAAAAAGGTCATAGGGTTACTTTCTTGGCTCCC 
AAGAAAGCTCAGAAACAACTCGAACCTCTCAACTTGTTCCCAAACAGCATTCACTTCGAG 
AATGTTACTCTTCCTCATGTTGATGGTCTCCCTGTTGGCGCAGAGACAACCGCGGATCTC 
CGGAACTCATCTAAGAGAGTCCTCGCTGATGCCATGGATCTTCTACGCGAACAGATTGAA 
GTTAAGATTCGTTCTTTGAAACCTGACCTA?^TTTTCTTCGATTTTGTTGATTGGATTCCA 
CAAATGGCAAAAGAATTAGGAATCAAAAGTGTAAGTTACCAGATCATATCGGCAGCTTTT 
ATAGCTATGTTTTTCGCTCCTCGTGCTGAATTAGGTTCTCCTCCACCTGG6TTTCCTTCA 
TCAAAAGTAGCATTACGTGGACATGACGCTAACATCTATTCACTCTTCGCAAACACCCGC 
AAATTTCTCTTTGATCGAGTCACCACAGGCCTTAAGAACTGCGACGTCATTGCCATAAGG 
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ACATGTGCAGAAATCGAAGGTAACTTATGTGATTTCATCGAAAGACAATGTCAGAGAAAA 

GTTCTCTTAACCGGTCCAATGTTCCTTGATCCACAAGGGAAGAGTGGTAAGCCGCTAGAA 

GATCGATGGAATAATTGGTTAAACGGATTTGAACCAAGCTCGGTAGTGTACTGT6CGT 

GGCACCCATTTCTTTTTCGAGATAGATCAATTTCaAGAACTCTGTTTAGGAATGGAGCTC 

ACGGGTCTACCTTTTTTGGTAGCGGTTATGCCACCGAGAGGGTCTTCAACGATTCAAGAA 

GCATTACCAGAAGGGTTCGAAGAACGGATTAAAGGGCGTGGAATTGTTTGGGGAGGATGG 

GTGGAACAACCTTTGATATTGTCTCATCCATCAATAGGTTGCTTTGTGAACCATTGCGGG 

TTCGGTTO^TGTGGGAGTCTTTGGTTAGTGATTGCCa^GATTGTGTTTATTCa^Cy^ 

GTTGATCAAGTTCTCACAACGAGATTGTTGACCGAAGAACTCGAGGTCTCCGTGAAAGTA 

AAGAGAGATGAAATTACTGGTTGGTTTTCGAAGGAGAGCTTGAGGGATACGGTCAAATCT 

GTGATGGATAAAAATAGTGAGATTGGGAATCTAGTGAGGAGGAATCATAAGAAACTGAAG 

GAAACTTTGGTTAGTCCTGGATTGTTGAGTAGTTATGCTGATAAGTTTGTTGACGAATTA 

GAGAATCATATCCACAGTAAGAATTGA 



>UGT79B5 . 

ATGGGATCAAAATTTCATGCTTTTATGTATCCAT6GTTTGGTTTTGGTCATATGATTCCA 

TATCTTCATTTAGCCAACAAACTAGCTGAGAAAGGTCATAGGGTCACTTTCTTCCTCCCC 

AAG/^GCTCmTAAGCAGCTCCAACCTCTCAATCTGTTCCCAGACAGCATTGTCTTTGAG 

CCTCTTACTCTCCCTCCTGTCGATGGTCTCCCTTTTGGCGCCGAGACAGCCTCGGATCTC 

CCAAACTCAACTAAGAAACCCATATTCGTTGCCATGGATCTCTTACGCGATCAGATCGAA 

GCAAAGGTCCGTGCTTTGAAACCAGATCTAATCTTTTTCGATTTTGTTCATTGGGTTCCA 

GAAATGGCAGAAGAGTTTGGAATAAAGAGTGTCAATTACCAGATCATATCGGCA6CTTGT 

GTAGCTATGGTTCTTGCACCTAGGGCTGAATTAGGGTTTCCTCCGCCGGATTATCCTTTA 

TCCAAAGTGGCGTTACGTGGACATGAAGCTAACGTCTGTTCTCTCTTTGCGAATTCCCAT 

GAGCTTTTCGGTCTGATGACCAAAGGCCTTAAGAACTGTGACGTCGTTTCCATAAG 

TGCGTGGAACTTGAAGGTAAGCTATGCGGTTTCATCGAAAAAGAATGTCAAAAGAAACTT 

CTCTTAACCGGTCCy^ATGCTCCCTGAACCGCAAAATAAGAGTGGTA^ 

CGATGGAATCACTGGTTAAACGGATTTGAACCAGGGTCGGTAGTGTTTTGTGCGTTTGGC 

ACTCAATTCTTTTTCGAGAAGGATCAATTTG?^GAATTCTGTTTAGGAATGGA.^ 

GGTCTACCGTTTTTAATATCGGTTATGCCGCCAAAAGGCTCACCAACGGTTCAAGAAGCG 

TTACCAAAAGGATTCGAAGAACGGGTTAAAAAGCATGGAATCGTTTGGGAAGGATGGTTG 

GAACAACCTTTGATATTGTCTCATCCATCAGTAGGTTGCTTTGTGAACCATTGTGGCTTT 

GGTTCAATGTGGGAGTCTTTGGTTAGTGATTGTCAGATTGTGTTTATTCCACAATTGGCA 

GATCa^GTTCTCyiTCACAAGATTGTTGACTGAAGAACTCGAAGTCTCTGTGA^ 

AGAGAAGATTCCGGATGGTTCTCGAAAGAGGACTTGAGAGATACTGTTAAATCTGTGATG 

GATATAGATAGTGAGATTGGGAACTTAGTGAAGAGGAATCATAAGAAATTGAAAGAGACT 

TTAGTTAGTCCTGGATTGTTAAGTGGTTATGCTGATAAGTTTGTAGAAGCATTGGAGATT 

GAAGTO^CAACACCAAATTTT 



>UGT79B6 ■ ' • 

ATGGGGTCAAAGTTTCATGGTTTTATGTTCCCATGGTTTGGTTTTGGTCACATGACTGCA 
TTTTTGCATCTGGCTAACAAACTAGCGGAGAAAGACCACAAAATAACTTTCTTGCTCCCC 
AAGAAAGCTCGAAAGCAACTTGAATCTCTCAATCTCTTCCCAGACTGCATTGTCTTTCAG 
ACTCTTACCATCCCATCTGTAGATGGCCTCCCTGATGGTGCTGAGACAACCTCGGATATC 
CCGATCTCGTTAGGCAGTTTTCTCGCCTCGGCTATGGATCGGACACGCATTCAGGTCAAA 
GAAGCAGTTTCTGTTGGTAAACCGGATCTGATTTTCTTCGATTTTGCTCACTGGATTCCG 
GAAATAGCTAGAGAGTATGGAGTCAAGAGTGTCAATTTCATAACGATTTCTGCAGCATGT 
GTAGCTATTTCGTTCGTCCCTGGTCGTAGTCAAGATGACTTGGGTAGTACTCCACCGGGA 
TACCCTTCCTCCAAGGTGTTGCTTCGGGGACACGAAACCAACAGTTTGTCGTTCCTCTCC 

tatccgtttggagatggaactagtttttacgaacggatcatgataggacttaagaactgc 

gatgtcatttcgataaggacatgccaagaaatggaaggaaagttctgcgatttcatcgaa 
aaccaatttcaaagaaaagttctcttgacaggtccaatgcttcctgagccggacaatagc 
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AAACCGCTAGAAGATCAATGGCGTCAGTGGCTTAGCAA6TTCGATCCGGGATCAGTAATA 
TATTGTGCATTGGGCAGCCAAATCATTCTTGAAAAGGATC?y^TTCCAAGAACTCTGTT^^ 
GGAATGGAGCTGACAGGTTTACCATTTCTTGTAGCGGTAAAGCCACCAAAAGGTTCATCG 
ACAATCCAAGAAGCCTTACCAAAAGGGTTTGAAGAGAGGGTTAAAGCACGTGGAGTGGTT 
TGGG6AGGATGGGTGCAGCAACCATTGATATTAGCTCATCCATCAATAGGCTGCTTTGTG 
AGCCATTGTGGTTTCGGGTCAATGTGGGAGGCTCTAGTGAATGACTGCCAAATAGTGTTT 
ATTCCACATTTGGGTGAGCAAATATTGAACACAAGACTGATGAGCGAGGAACTCAAGGTC 
TCGGTAGAGGTGAAAAGAGAGGAAACGGGATGGTTTTC6AAGGAGAGCTT6AGCGGTGCG 
GTCAGGTCTGTGATGGACAGAGATAGCGAGCTCGGGAATTGGGCGAGGAGGAACCACGTA 
AAGTGGAAGGAGTCTCTGCTTCGTCATGGACTAATGAGTGGTTATCTTAATAAGTTCGTA 
GAAGCATTGGAGAAACTAGTCCAAAATATAAATCTTGAATGA 



>UGT79B7 . 

ATGGAGCCAAAGTTTCATGCTTTTATGTTTCCATGGTTTGCTTTTGGTCATATGATTCCA 
TTTCTACATCTTGCAAACAAACTAGCTGAAAAAGGTCACCGAGTTACTTTCTTGCTACCT 
AAGAAAGCACaLAAAACAGTTGGAACATCACAACTTGT 

CCTCTCACAGTTCCTCCTGTCAATGGCCTCCCTGCTGGTGCCGAGACAACCTCGGATATC 
CCCATCTCGTTGGACTy^CCTCTTGTCO^ 

GCTGCGGTTCGTGCTTTGAGACCTGACTTGATCTTTTTCGATTTTGCTCAATGGATTCCA 

GATATGGCTAAAGAACATATGATCAAGAGTGTGAGTTACATCATTGTATCTGCGACAACA 

ATAGCTCATACACATGTCCCTGGAGGTAAATTAGGTGTTCGCCa^CCGGGTT^^ 

TCAAAGGTGATGTTCCGTGAAAACGATGTTCATGCCTTAGCAACCTTATCGATATTTTAC 

AAGAGACTGTATCATCAGATCACTACAGGTCTTAAGAGCTGTGATGTCATTGCATTGAGG 

ACTTGCJ^GAAGTCiSAAGGTATGTTCTGCGACTTTATATCGCGTCAATACCATAAGAAG 

GTTCTCTTGACTGGTCCAATGTTCCCTGAGCCAGACACAAGTAAACCACTAGAAGAACG 

TGGAATCATTTTCTAAGCGGGTTC6CGCCGAAGTCAGTAGTGTTTTGTTCACCTGGCAGC 

caagtaattcttgagaaagatcaattccaagaactctgtttagggatggagct;^ 

TTACCATTTCTTTTAGCGGTAAAGCCACCAAGAGGATCATCAACGGTCCAAGAAGGGTTA 

ccagaagggttcgaggagcgggtgaaagatcgtggtgttgtttggggaggatgggtgcaa 

CAACCTTTGATATTGGCTCATCCATCAATAGGTTGCTTTGTGAACCATTGTGGTCCCGGA 

acaatatgggagtctttggtgagtgattgccaaatggttttgattccatttttaagtgat 

caagttctcttcacaagattgatgaccgaggaattcgaggtctctgtagaagtgccgagg 

gaaaaaacaggatggttttcaaaggagagcttgagcaatgctatcaaatctgtgatggat 

aaagact^gtgacattgggaagttagtgaggagtaaccacaiccaaattgaagg^^ 

gttagtcctggattattgactggttacgttgatcactttgtagagggattgcaagagaat 

TTGATTTGA * • 



>UGT79B8 . - , ■ • ■ . ' ■ 

ATGGAGCC?^CGTTCCATGCTTTTATGTTTCCCTGGTTTGCTTTTG 

tttctacatcttgcaaacat^ctagctgagaaaggtcatca;^ 

AAGAAAGCCCAAAAACAGTTGGAACATCACAATCTGTTCCCAGACAGTATTGTCTT^ 

CCTCTCACAATCCCTCATGTCAATGGCCTCCCTGCTGGTGCTGAGACAACCTCGGATATC 

TCAATCTCGATGGACAACTTACTGTCGGAAGCCTTGGATCTCACTCGCGATCAGGTTGAA 

gctgcggttcgtgctctgagaccggacttgatcttttttgattttgctcattggattcca 

gaaattgccaaagagcatatgatcaagagtgtgagttacatgatagtatctgcaacaaca 

atagcttatacatttgcccctggtggtgtattaggtgttcccccaccaggttatccttca 

tcaaaggtgttgtaccgtgaaaacgatgctcatgccttagcaaccttatctatcttctac 

aagagactttatcatcagatcactacaggttttaagagctgtgacatcattgcattgagg 

acatgtaatgaaatcgaaggtaaattctgcgactatatatcaagtcaata 

gttctcttgactggtccaatgctccctgagcaagacacaagtaaaccactagaagaacag 

TTGAGTCATTTTCTGAGCAGGTTCCCACCGAGGTCAGTGGTGTTTTGTGCACTTGGTAGC 

cagatcgttcttgaaaaggatcaattccaagaactctgcttagggatggagctgacaggt 
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TTACCGTTTCTTATAGCGGTAAAGCCACCGAGAGGATCATCGACGGTCGAAGAAGG6TTA 
CCAGAAGGGTTCCAGGAGCGGGTGAAAGGGCGTGGTGTGGTTTGGGGAGGATGGGTGCAA 
CAACCATTGATATTGGATCATCCGTCAATAGGCTGCTTTGTGAACCATTGTGGTCCGGGA 
ACAATATGGGAGTGTCTTATGACTGATTGTCAAATGGTTTTGCTTCCATTTTTAGGTGAT 
CAAGTTCTCTTCACAAGATTGATGACCGAGGAATTCAAGGTGTCTGTAGAAGTGTCGAGA 
GAAAAAACAGGATGGTTTTCAAAGGAGAGCTTGAGCGATGCGATCAAGTCTGTGATGGAT 
AAAGATAGCGACCTCGGAAAGCTAGTGAGGAGTAACCACGCCAAATTGAAGGAGACTCTT 
GGTAGTCATGGATTATTAACTGGTTACGT6GATAAATTTGTAGAGGAATTGCAAGAGTAT 
TTGATTTGA 



>UGT79B9 - 

ATGGGCCAAAATTTTCACGCTTTTATGTTCCCATGGTTCGCTTTTGGTCATATGACTCCA 

TACTTGCATCTAGCCAACAAGCTAGCTGCTAAAGGTCATAGGGTTACTTTCTTGCTGCCT 

AAGAAAGCTCaAAAACAGTTGGAACATCACAATCTGTTTCCAGACAGGATCATCTTT^ 

TCTCTTACTATTCCCCATGTTGATGGCCTACCTGCTGGCGCGGAGACCGCCTCGGACATC 

CCCATCTCGTTGGGGAAGTTTCTTACCGCAGCCATGGATCTCACTCGCGATCA.GGTCGAA 

GCCGCGGTTCGTGCTTTGAGACCy^GACCTGATCTTTTTCGATACTGCTTATTGGGTTCCG 

GAAATGGCGAAAGAACACAGAGTCAAGAGTGTGATATACTTTGTGATATCAGCTAACTCC 

ATAGCTCATGAACTTGTACCAGGTGGTGAATTAGGAGTTCCTCCACCTGGCTATCCTTCG 

TCAAAAGTGTTGTACCGTGGACACGATGCTCACGCTTTGTTGACTTTTTCCATCTTCTAC 

GAGAGGCTTCATTACCGGATAACAAOVGGTCTAAAGAATTGTGATTTTATCTCyy^TT^ 

ACTTGTAAAGAAATCGAAGGTAAATTCTGCGACTATATAGAGCGTCAATACCAGAGGAAG 

GTTCTTTTGACAGGTCCAATGCTTCCAGAGCCAGATAACAGTAGACCACTCGAAGATCGA 

TGGAATCACTGGCTGAATCAGTTCAAACCCGGCTCGGTAATATATTGTGCATTGGGAAGT 

CAAATCACTCTAGAGAAGGATCAATTCCAAGAACTCTGTTTAGGAATGGAGCTCACTGGT 

TTACCGTTTCTCGTAGCGGTAAAACCACCAAAAGGCGCAAAGACGATCCAAGi^ 

CCAGAAGGGTTTGAGGAGAGGGTGAAGAATOITGGAGTAGTTTGGGGAGA^ 

CAA.CCATTGATATTGGCTCATCCATCAGTAGGCTGCTTTGTGACCCATTGTGGGTTTGGA 

TCAATGTGGGAGTCTCTAGTGAGTGATTGTCAAA.TAGTCTTGCTTC(31TATT'TGTGTGAT 

CAAATTCTCAACACTAGATTGATGAGTGAGGAACTCGAGGTTTCGGTGGAAGTGAAAAGA 

gaagaaacaggatggttctcgaaagagagcttaagtgttgcgatcacctcggtgatggac 
aaagatagtgagttagggaatctggtgaggaggaaccacgctaaattaaaggaggttttg 

GTTAGTCCTGGATTATTAACCGGTTACACCGATGAATTTGTTG2^CTTTGCAGAATATA 
GTCAACGATACAAATCTTGAATGA • 



>UGT82A1 • • 

ATGAAAGTAACACAAAAGCCAAAGATAATATTCATCCCTTATCCGGCGCAAGGCCACGTC 

ACTCCGATGCTTCACCTTGCATCGGCTTTCCTCAGCCGTGGATTCTCCCCTGTCGTTATG 

ACTCCCGAGTCTATCCACCGTAGGATCTCGGCTACTAACGAGGATCTTGGGATC^^ 

TTGGCCTTATCTGACGGTCAAGATCiSTCCGGACGCACCTCCCTCGGACTTCTTCTCGATA 

GAGAACTCAATGGAGAACATCATGCCACCACAGCTC'GAACGGCTCCTACTAGAAGAAGAC 

TTGGATGTGGCTTGTGTTGTGGTTGATTTGCTGGCTTCGTGGGCTATAGGAGTGGCTGAT 

CiSGTGTGGAGTTCCGGTCGCCGGATTCTGGCCGGTGATGTTCGCTGCTTACCGTTTGATC 

CAAGCAATACCGGAGCTAGTCCGAACAGGCTTAGTTTCCCAAAAAGGTTGTCCTCGTCAA 

CTAGAAAAAACAATAGTCCAGCCAGAGCAACCGCTCCTATCCGCAGAAGATCTACCGTGG 

CTGATCGGAACTCCCAAAGCTCAGAAAAAACGATTCAAGTTCTGGCAAAGAACTCTAGAA 

CGAACAAAAAGTCTCCGTTGGATCTTGACAAGCTCCTTTAAAGATGAATATGAAGATGTC 

GACAACCACAAAGCATCCTACi^AAAAATCTAACGATTTAAACAAAGAAAA^ 

AACCCTCAAATCCTTCATTTAGGTCCATTGCATAACCAAGAAGCAACAAATAATATAACT 

ATAACCAAGACTAGTTTTTGGGAAGAAGACATGTCTTGTCTAGGTTGGCTTO^GAAC^ 

AACCCGAACTCAGTCATTTATATCTCATTTGGAAGTTGGGTTTCTCCTATAGGAGAATCA' 

AATATTCAAACGTTGGCATTGGCGTTGGAAGCGTCAGGGAGACCTTTCCTTTGGGCGTTA 
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AACCGAGTGTGGCAAGAGGGACTACCACCAGGTTTTGTGCATAGAGTCACAATTACCAAA 
AACCAAGGAAGGATCGTCTCATGGGCTCCGCAACTTGAAGTTCTTAGAAACGATTCTGTG 
GGATGTTACGTGACTCATTGTGGCTGGAACTCGACTATGGAGGCAGTGGCAAGTTCCCGG 
AGGCTACTATGTTATCCGGTGGCCGGAGACCAGTTTGTTAACTGTAAATACATCGTGGAC 
GTTTGGAAGATTGGAGTGAGATTGAGCGGGTTTGGAGAGAAGGAGGTTGAAGATGGACTA 
AGGAAAGTAATGGAGGATCAAGATATGGGTGAGAGATTGAGGAAGTTAAGAGACAGAGCA 
ATGGGGAATGAAGCTCGTTTGAGTTCGGAAATGAATTTTACATTTTTAAAAAACGAGCTT 

AATTAG ■ ] 



>UGT83A1 

ATGGATAATAACTCy^TAAAAGAATGGGAAGGCCACATGTTGTGGTCATACCTTACCCT 

GCACAAGGTCATGTTCTTCCTCTAATAAGTTTCTCACGTTACCTTGCGAAAp^GGAATC 

CAAATTACATTCATAAACACCGAGTTTAACCATAACCGCATCATCAGTTCCTTACCCAAT 

TCACCTCATGAAGATTATGTTGGGGATCAGATCAATCTTGTTTCAATCCCTGACGGTTTA 

GAAGATTCACCAGAAGAGAGGAACATTCCAGGGAAGTTGTCGGAGTCTGTTTTGCGTTTT 

ATGCCTAAAAAAGTAGAGGAATTGATCGAGAGGATGATGGCAGAJ^ACTAGCGGTGGTACG 

ATCATTAGCTGCGTTGTAGCGGATCAGAGCTTGGGATGGGCAATTGAAGTTGCAGCTAAG 

TTTGGGATCAGACGCACCGCGTTTTGTCCTGCTGCAGCTGCGTCTATGGTTCTTGGATTT 

AGTATTCAAAAACTTATCGATGATGGTCTCATAGATTCTGATGGGACTGTGAGAGTAAAT 

AAGACAATTCAACTATCTCCCGGGATGCCAAAGATGGAAACAGACyy^GTTTGTGTGG^^ 

TGTCTGAAGAACAAAGAATCTCAGAAAAACATATTCCAACTTATGCTTCAAAACAATA^^ 

TCGATCGAGTCAACGGATTGGTTGTTGTGTAACTCTGTCCATGAACTTGAAACTGCAGCA 

TTTGGATTGGGCCCGAATATAGTACCAATTGGGCCCATTGGTTGGGCTCATAGTCTTGAA 

GAGGGATCCACGTCa^CTAGGAAGCTTTTTACCTCATGACCGGGATTGTCTAGATTGGTTG 

GACCGGCAGATTCCCGGTTCGGTTATATATGTTGCCTTTGGGAGTTTTGGGGTCATGGGC 

AACCCTCAGTTAGAAGAGCTAGCAATTGGTCTAGAGCTTACCAAGAGGCCAGTTTTGTGG 

GTCACTGGTGATCAACAACCAATCAAACTTGGGTCGGATCGAGTCAAAGTGGTGAGATGG 

GCTCCACAACGGGAGGTCCTTTCTTCTGGAGCCATTGGGTGTTTTGTGAGCCATTGTGGA 

TGGAATTCAACTCTGGAAGGAGCCCAAAATGGCATACCATTTCTATGCATCCCTTATTTT 

GCAGACCAATTTATCAACAAAGCATATATATGCGATGTGTGGAAGATTGGATTAG6ACTT 

GAAAGAGACGCACGAGGAGTGGTTCCGAGGTTAGAGGTTAAGAAGAAGATCGATGAGATC 

ATGAGAGACGGTGGAGAGTATGAA6AACGAGCTATGAAGGTTAAAGAGATTGTGATGAAA 

AGTGTTGCyy^GATGGAATATCTTGTGAGAATCTTAATAAATTTGTCAACTGGATCAAA 

TCACAAGTGAATT6A ■ 



>UGT84A1 \ . : 

ATGGTGTTCGT^CTTGTCCATCTCCAAACCCAATTCATGTAATGCTCGTCTCGTTTCAA 
GGACAAGGCCACGTCAACCCTCTTCTTCGTCTCGGCAAGTTAATTGCTTCAAAGGGTTTA 
CTCGTTACCTTCGTTACAACGGAGCTTTGGGGCAAGAAAATGAGACAAGCCAACAAAATC 
GTTGACGGTGAACTTAAACCGGTTGGTTCCGGTTCAATCCGGTTTGAGTTCTTTGATGAA 
GAATGGGCAGAGGATGATGACCGGAGAGCTGATTTCTCTTTGTACATTGCTCACCTAGAG 
AGCGTTGGGATACGAGAAGTGTCTAAGCTTGTGAGAAGATACGAGGAAGCGAACGAGCCT 
GTCTCGTGTCTTATCAATAACCCGTTTATCCCATGGGTCTGCCACGTGGCGGAAGAGTTC 
AACATTCCTTGTGCGGTTCTCTGGGTTCAGTCTTGi'GCTTGTTTCTCTGCTTATTACCAT 
TACCAAGATGGCTCTGTTTCATTCCCTACGGAAACAGAGCCTGAGCTCGATGTGAAGCTT 
CCTTGTGTTCCTGTCTTGAAGAACGACGAGATTCCTAGCTTTCTCCATCCTTCTTCTAGG 
TTCACGGGTTTTCGACAAGCGATTCTTGiSGCAATTCAAGAATCTGAGCAAGTCCTTCTGT 
GTTCTAATCGATTCTTTTGACTCATTGGAACAAGAAGTTATCGATTACATGTCAAGTCTT 
TGTCCGGTTAAAACCGTTGGACCGCTTTTCAAAGTTGCTAGGACAGTTACTTCTGACGTA 
AGCGGTGACATTTGCy^TCAACAGATAAATGCCTCGAGTGGTTAGACTCGAGGCCTAAA 
TCGTCAGTTGTCTACATTTCGTTCGGGACAGTTGCATATTTGAAGCAAGAACAGATCGAA 
GAGATCGCTCACGGAGTTTTGAAGTCGGGTTTATCGTTCTTGTGGGTGATTAGACCTCCA 
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CCACACGATCTGAAGGTCGAG^ 

AAAGGTAAAGGGATGATTGTGGATTGGTGCCCACAAGAGCAAGTCTTGTCTCATCCTTCA 
GTGGCATGCTTCGTGACTCATTGTGGATGGAACTCGACAATGGAATCTTTGTCTTCAGGT 
GTTCCGGTGGTTTGTTGTCCGCAATGGG6AGATCAAGTGACTGATGCAGTGTATTTGATC 
GATGTTTTCAAGACCGGGGTTAGACTAGGCCGTGGAGCGACCGAGGAGAGGGTAGTGCCA 
AGGGAGGAAGTGGCGGAGAAGCTTTTGGAAGCGACAGTTGGGGAGAAGGCAGAGGAGTTG 
AGAAAGAACGCTTTGAAATGGAAGGCGGAGGCGGAAGCAGCGGTGGCTCCAGGAGGTTCG 
TCGGATAAGAATTTTAGGGAGTTTGTGGAGAAGTTAGGT6CGGGAGTAACGAAGACTAAA 
GATAATGGATACTAG 



>UGT84A2 

ATGGAGCTAGAATCTTCTCCTCCTCTACCTCCTCATGTGATGCTCGTATCTTTTCCAGGG 

a^GGCCACGTTAATCCACTTCTTCGTCTTGGTAAGCTCTTAGCTTC^^ 

ATAACCTTCGTCACCACTGAGTCATGGGGCAAAAAGATGCGAATCTCCAACAAAATCC^ 

GACCGTGTCCTCAAACCGGTTGGTAAAGGCTATCTCCGGTATGATTTCTTCGACGACGGG 

CTTCCTGAAGACGACGAAGCTAGCAGAACCAACTTAACCATCCTCCGACCACATCTAGAG 

CTGGTCGGCAAAAGAGAGATCT^GAACCTTGTGAAACGTTACAAGGi^GTAACGAT^ 

CCCGTGACATGTCTTATCAACAACCCTTTCGTCTCTTGGGTCTGTGACGTGGCAGAAGAT 

CTTCAAATCCCTTGTGCTGTTCTTTGGGTTCAATCTTGTGCCTGCTTAGCTGCTTATTAC 

TATTACCACCACAACCTAGTTGACTTCCCGACCAAAACAGAACCCGAGATCGATGTCCi^ 

ATCTCTGGCATGCCTCTCTTGAAACATGACGAGATCCCTTCTTTCATTCyiCCCTTCAAGT 

CCTCACTCCGCTTTGCGAGAAGTGATCATAGATCAGATTAAACGGCTTCACAAGACTTTC 

TCCATTTTCATCGACACTTTCyy^CTCATTGGAGAAAGACATGATTGACCACA 

CTCTCTCTCCCCGGTGTTATCAGACCGCTAGGACCACTCTACAAAATGGCTAAAACC6TA 

GCTTATGATGTCGTTAAAGTAAACATCTCTGAGCCAACGGATCCTTGCATGGAGTGGTTA 

GACTCGCAGCCAGTTTCCTCCGTTGTTTACATCTCATTCGGGACCGTTGCTTACTTGAAA 

a^G-?^CAAATAGACGAGATCGCTTACGGTGTGTTAAACGCCGACGTT^^ 

GTGATTAGACAACAAGAGTTAGGTTTCAAC?^G 

AAAGGGAAAGGGAAGATCGTTGAATGGTGTTCACAAGAGTyyiGTATTATCTCATCCTTCA 
GTGGCATGTTTCGTGACTCACTGTGGATGGAACTCAACGATGGAAGCTGTGTCTTCCGGA 
GTCCCGACGGTTTGTTTTCCTCAATGGGGAGATCAAGTCACGGACGCCGTTTACATGATC 
GATGTTTGGAAGACGGGAGTGAGGCTAAGCCGTGGAGAGGCGGAGGAGAGGTTAGTGCCG 
AGGGAGGAAGTTGCGGAGAGGTTGAGAGAGGTTACTAAAGGAGAGAAAGCGATCGAGTTG 
AAAAAGAATGCTTTGAAGTGGAAGGAAGAGGCGGAGGCGGCGGTTGCTCGCGGTGGTTCG 
TCGGATAGGAATCTTGAAAAGTTTGTGGAGAAGTTGGGTGCCAAACCTGTGGGGAAAGTA 
CAAAACGGGAGTCATAATCATGTCTTGGCTGGATCAATCAAAAGCTTTTAA 



>UGT84A3 - ■ . ; . • • 

ATGGACCCGTCTCGTCATACTCATGTGATGCTCGTATCTTTCCCCGGCCAAGGTCACGTA 
AACCCTCTACTTCGTCTCGGAAAGCTCATAGCCTCTAAAGGCTTACTCGTCACCTTTGTC 
ACCACAGAGAAGCCATGGGGCAAGAAGATGCGTCAAGCCAAC^ 

CTCAAACCGGTCG6TCTAGGTTTCATCCGGTTTGAGTTCTTCTCTGACGGCTTCGCCGAC 

GACGATGAAAAAAGATTCGACTTCGATGCCTTCCGACCACACCTTGAAGCTGTCGGAAAA 

CAAGAGATCAAGAATCTCGTTAAGAGATATAACAAGGAGCCGGTGACGTGTCTCATAAAC 

AACGCTTTTGTCCCATGGGTATGTGATGTCGCCGAGGAGCTTCACATCCCTTCGGCTGTT 

CTATGGGTCCAGTCTTGTGCTTGTCTCACGGCTTATTACTATTACCACCACCGGTTAGTT 

AAGTTCCCGACCAAAACCGAGCCGGACATCAGCGTTGAAATCCCTTGCTTGCCATTGTTA 

AAGCATGACGAGATCCCAAGCTTTCTTCACCCTTCGTCTCCGTATACAGCTTTTGGAGAT 

ATCATTTTAGACCAGTTAAAGAGATTCGAAAACCACAAGTCTTTCTATCTTTTCATCGAC 

ACTTTTCGCGAACTAGAAAAAGACATCATGGACa^CATGTCyi.CAACTTTGTCCTC^ 

ATCATCAGTCCTGTCGGTCCGCTCTTCAAGATGGCTCAAACCTTGAGTTCTGACGTTAAG 

GGAGATATATCCGAGCCAGCGAGTGACTGCATGGAATGGCTTGACTCAAGAGAACCATCC 
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TOVGTCGTTTACATCTCCTTTGGGACTATAGCCAACTTGAAGCAAGAGCAGATGGAGGAG 
ATCGCTCATGGCGTTTTGAGCTCTGGCTTGTCGGTCTTATGGGTGGTTCGGCCTCCCATG 
GAAGGGACATTTGTAGAACCACATGTTTTGCCTCGAGAGCTCGAAGAAAAGGGTAAAATC 
GTGGAATGGTGTCCCCAAGAGAGAGTCTTGGCTCATCCTGCGATTGCTTGTTTCTTAAGT 
CACTGCGGATGGAACTCGACAATGGAGGCTTTAACTGCCGGAGTCCCCGTTGTTTGTTTT 
CCGCAATGGGGAGATCAAGTGACTGATGCGGTGTACTTGGCTGATGTTTTCAAGACAGGA 
GTGAGACTAGGCCGCGGAGCCGCTGAGGAGATGATTGTTTCGAGGGAGGTTGTAGCAGAG 
AAGCTGCTTGAGGCCACAGTTGGGGAAAAGGCGGTGGAGCTGAGAGAAAACGCTCGGAGG 
TGGAAGGCGGAGGCCGAGGCCGCCGTGGCGGACGGTGGATCATCTGATATGAACTTTAAA 
GAGTTTGTGGACAAGTTGGTTACGAAACATGTGACGAGAGAAGACAACGGAGAACACTAG 



>UGT84A4 

ATGGAGATGGAATCGTCGTTACCTCATGTGATGCTCGTATCATTCCCAGGGCAAGGTCAC 

ATi^GCCCTCTTCTTCGTCTCGGAAAGATCATTGCCTCTAAAGGCTTAATCGTCACCTTT 

GTAACCACAGAGGAACCATTGGGCAAGAAGATGCGTCAAGCCAACAATATTCAAGACGGT 

GTGCTCAAACCGGTCGGGCTAGGTTTTCTCCGGTTCGAGTTCTTCGAGGATGGATTTGTC 

TACAA^GAAGACTTTGATTTGTTACAAAAATCACTTGTAGTTTCCGGAAAACGAGAGATC 

AAGAATCTTGTCAAGAAATATGAGAAGCAACCAGTGAGATGTCTCATAAATAATGCCTTT 

GTTCCATGGGTTTGTGACATAGCCGAGGAGCTTCAAATCCCATCAGCTGTTCTTTGGGTC 

CAGTCTTGTGCTTGCCTCGCCGCTTATTACTATTACCACCACCAGTTAGTTAAGTTTCCG 

ACCGAAACCGAGCCXSGAAATAACCGTTGACGTCCCTTTCAAGCCATTAACATTGAAGCAT 

GACGAGATCCCTAGCTTTCTTCACCCTTCCTCTCCGCTGTCCTCTATAGGAG6TACCATT 

TTAGAGCAGATCAAGCGACTTCACAAGCCTTTCTCTGTTCTCATCGAAACTTTTCAAGAA 

CTTGAAAAAGATACCATTGACCACATGTCCCAGCTCTiSCCCTCAAGTCAACTTa^ 

ATCGGTCCGCTTTTTACTATGGCTAAAACCATAAGGTCTGACATCAAGGGAGACATCTCC 

AAGCCAGATAGTGACTGCATAGAGTGGCTTGACTCGAGAGAACCATCCTCCGTTGTTTAC 

ATCTCTTTTGGQACTTTGGCTTTCTTGAAGCTAAACCT^GATCQACGAGATTGCTCACGGC 

ATTCTCAACTCCGGGTTGTCCTGCTTATGGGTTTTGCGGCCTCCCTTAGAAGGCTTAGCC 

ATAGAACCGCATGTCTTGCCTCTAGAGCTTGAAGAGAAAGGGAAGATTGTGGAATGGTGT 

CAACAAGAGAAAGTTTTGGCTCATCCTGCGGTTGCTTGCTTCTTAAGTCACTGTGGATGG 

AACTCAACCATGGAGGCTTTAACTTCAGGAGTTCCCGTTATTTGTTTCCCGCAGTGGGGA 

gatcaggtgacaaatgcggtgtacatgattgatgttttcaagacaggattgagactcagc 
cgtggagcttccgatgagaggattgttccaagggaggaggttgctgagcgactgcttqag 
gccaccgttggagagaaggcggtggagctgagagaaaacgctcggaggtggaaggaggag 
gcggagtctgccgtggcttacggtggaacatcggaaaggaattttcaagagtttgttgac 
aagttggttgatgtcaagacaatgacaaacattaataatgtcgtgtaa 



>UGT84B1 ■-. . ■ . .. • ■■ 

ATGGGCAGTAGTGA6GGTCAAGAAACACATGTCCTAATGGTAACACTACCATTCCAAGGT 
CACT^TCS^TCCT^TGCTO^CTCGCy^AAAaiTCT^^ 

ATCAATCTCGCCACTATTGAGTCAGCCCGTGATCTCCTCTCCACCGTAGAAAAACCTCGT 
TATCCGGTGGACCTCGTGTTCTTCTCCGATGGTCTACCTAAAGAAGATCCAAAGGCCCCT 
GAAACTCTTTTGAAGTCATTGAATAAAGTCGGAGCCATGAACTTGTCTAAAATCATCGAA 
GAAAAGAGATACTCTTGTATCATCTCTTCGCCTTTTACTCCATGGGTTCCAGCTGTTGCA 
GCCTCTCATAACATCTCTTGTGCAATACTTTGGATCCAAGCTTGTGGAGCTTACTCGGTT 
TATTACCGTTACTACATGAAGACAAACTCTTTCCCTGATCTTGAAGATCTGAATCAAACG 
GTGGAGTTACCAGCTTTACCATTGTTGGAAGTTCGAGATCTTCCATCGTTTATGTTACCT 
TCTGGTGGTGCTCACTTCTATAATCTAATGGCGGAATTTGCAGATTGTTTGAGGTATGTG 
AAATGGGTTTTGGTTAATTCATTCTATGAACTCGAATCAGAGATAATCGTy^TCGAtGGCT 
GATTTAAAACCTGTAATTCCAATTGGTCCTCTGGTTTCTCCATTTCTGTTGGGCGATGGT 
GAGGAGGAAACCCTAGACGGTAAAAACCTAGATTTTTGTAAATCTGATGATTGTTGTATG 
6AGTGGCTTGACAAGCAAGCTAGGTCTTCTGTTGTGTACATATCTTTCGGAAGTATGCTC 



68 



wo 2004/113540 PCT/GB2004/002432 



GAAACATTGGAGAATCAGGTCGAGACCATAGCGAAGGCGCTGAAGAACAGAGGACTTCCA 
TTTCTTTGGGTGATAAGGCCAAAGGAGAAAGCCCAAAACGTTGCTGTTTTGCAGGAGATG 
GTGAAAGAAGGACAAGGGGTTGTTCTCGAGTGGAGTCCACAAGAGAAGATTTTGAGCCAC 
GAGGCAATCTCTTGTTTTGTCACGCATTGCGGCTG6AACTCGACTATGGAGACGGTGGTG 
GCTGGTGTTCCTGTGGTAGCGTACCCTAGCTGGACGGATCAGCCCATTGACGCGCGGTTG 
CTTGTTGATGTGTTTGGAATCGGAGTAAGGATGAGGAATGACAGTGTCGATGGCGAGCTT 
AAGGTCGAAGAAGTAGAAAGATGCATTGAGGCCGTGACGGAGGGACCCGCTGCCGTGGAT 
ATAAGAA.GGAGA6CGGCGGAGCTAAAGCGCGTGGCGAGATTGGCGTTGGCACCTGGTGGA 
TCTTCGACACGGAATTTAGACTTGTTCATTAGTGATATCACAATCGCCTAA 



>UGT84B2 . • 

■atgggaagtaatgagggtc^ 

catctcaatccaatgctcaaattcgca^^catctcgcacgaaccyy^tctacacttc^ 
ctcgccaccactgagcaagcccgtgacctcctctcttccaccgctgacgaacctcataga 

CCGGTGGACCTCGCTTTCTTCTCAGACGGTCTACCTAAAGACGATCCAAGAGATCCCGAC 

actctcgcaaagtcattgaaaaaagatggagccaagaacttgtcaaaaatcatcgaagaa 

AAGAGATTTGATTGCATGA.TCTCTGTGCCTTTTACTCCCTGGGTTCCAGCTGTTGCAGCT 

gcacataacattccttgtgcaatcctctggatccaagcttgtggagctttttctgtttat 

taccgttattacatgaagacaaatcctttccccgaccttgaagatctgaatcaaacag 

gagttaccagctttaccattgttggaagtccgagatctgccgtcattgatgttaccttct 

caaggagctaatgtcaataccctaatggcggaatttgcagattgtttgaaagatgt^ 

tgggttttggttaactcgttttacgaactcgaatcagagatcatcgagtctatgtctgat 

ttaa?^cctataatcccaattggtcctcttgtttctccattcctgttgggaaatgatgaa 

gaaaaaaccctagatatgtggaaagttgatgattattgtatggagtggcttgacaagcaa 

GCTAGGTCTTCAGTTGTTTACATATCTTTCGGAAGCATACTCAAATCATTGGAGAATCAA 

gttgagaccatagcaacggcattaaaaaacagaggagttccatttctttgggtgatacgg 
ccgaaggagaaaggcgaaaacgtccaggttttgcagqagatggttaaagaaggtaaagg6 
gttgtaactgaatggggtcaacaagaaaagatattgagccacatggcgatttcttgcttc 
atcacgcattgtggatggaactcgacgatcgagacggtggtgactggtgttcccgtggtg 
gcgtatccgacttggatagatcagccgcttgatgcgagactgcttgtggatgtgtttgga 
atcggagtaaggatgaagaacgacgctatcgatggagagcttaaggttgcagaggtggag 
agatgcattgaggccgtgacagagggacctgccgccgcggatatgaggaggagagcgacg 
gagctgaagcacgccgcaagatcggcgatgtcacctggtggatcttccgctcagaattta 
gactcgttcattagtgatatcccaatcacttga ' 



>UGT85A1 ' ; 

atgggatctcagatcattcataactcacaaaaaccacatgtagtttgtgttccatatccg 
gctcaaggccacatcaaccctatgatgagagtggctaaactcctccacgccagaggcttc, 
tacgta^ccttcgtcaacaccgtctacaaccacaatcgtttccttcgttctcgtg^ 

AATGCCCTAGATGGACTTCCTTCGTTCCGATTTGAGTCCA,TTGCTGACG6TCTACCAGAG 
ACAGACATGGATGCCACGCAGGACATCACAGCTCTTTGCC3AGTCCACCATGAAGAACTGT 
CTCGCTCCGTTCAGAGAGCTTCTCCAGCGGATCAACGCTGGAGATAATGTTCCTCCGGTA 
AGCTGTATTGTATCTGACGGTTGTATGAGCTTTACTCTTGATGTTGCGGAGGAGCTTGGA 
GTCCCGGAGGTTCTTTTTTGGACAACCAGTGGCTGTGCGTTCCTGGCTTATCTACACTTT 
TATCTCTTCATCGAGAAGGGCTTATGTCCGCTAAAAGATGAGAGTTACTTGACGAAGGAG 
TACTTAGAAGACACGGTTATAGATTTTATACCAACCATGAAGAATGTGAAACTAAAGGAT 
ATTCCTAGCTTCATACGTACCACTAATCCTGATGATGTTATGATTAGTTTCGCCCTCCGC 
GAGACCGAGCGAGCCAAACGTGCTTCTGCTATCATTCTAAACACATTTGATGACCTTGAG 
CATGATGTTGTTCATGCTATGCAATCTATCTTACCTCCGGTTTATTCAGTTGGACCGCTT 
CATCTCTTAGCAAACCGGGAQATTGAAGAAGGTAGTGAGATTGGAATGATGAGTTCGAAT 
TTATGGAAAGAGGAGATGGAGTGTTTqGATTGGCTTGATACTAAGACTCAAAATAGTGTC 
ATTTATATCAACTTTGGGAGGATAACGGTTTTGAGTGTGAAGCAGCTTGTGGAGTTTGCT 
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TGGGGTTTGGCGGGAAGTGGGAAAGAGTTTTTATGGGTGATCCGGCCAGATTTAGTAGC6 
GGAGAGGAGGCTATGGTTCCGCCGGACTTTTTAATGGAGACTAAAGACCGCAGTATGCTA 
GCGAGTTGGTGTCCTCAAGAGAAAGTACTTTCTCATCCTGCTATTGGAGGGTTTTTGACG 
CATTGCGGGTGGAACTCGATATTGGA7UVGTCTTTCGTGTGGAGTTCCGATGGTGTGTTGG 
CCATTTTTTGCTGACCAGCAAATGAATTGTAAGTTTTGTTGTGACGAGTGGGATGTTGGG 
ATTGAGATAGGTGGAGATGTGAAGAGAGAGGAAGTTGAGGCGGTGGTTAGAGAGCTCATG 
GATGGAGAGAAGGGAAAGAAAATGAGAGAAAAGGCGGTAGAGTGGCAGCGCTTAGCCGAG 

aaagcqacggaacataaacttggttcttccgttatgaattttgagacggttgttagcaag 

TTTCTTTTGGGACAAAAATCACAGGATTAA ._ 



>UGT85A2 

ATGGGATCTCATGTCGCACAAAAACAACACGTAGTTTGCGTTCCTTATCCGGCTCAAGGC 

CACATCAACCCAATGATGAAAGTGGCTAAACTCCTTTACGCCay\AGGC 

TTCGTCAACACCGTCTACAACCACAACCGTCTCCTCCGGTCCCGTGGGCCTAACGCCGTT 

GACGGGCTTCCTTCTTTCCGGTTT6AGTCCATCCCTGACGGTCTACCCGAGACTGACGTG 

GACGTCACTCAGGACATCCCTACTCTTTGCGAGTCCACAATGAAGCACTGTCTCGCTCCA 

TTCAAGGAGCTTCTCCGGCAGATCAACGCAAGGGATGATGTTCCTCCTGTGAGCTGTATC 

GTATCCGACGGTTGTATGAGCTTCACACTTGATGCTGCGGAGGAGCTCG6TGTCCCGGAG 

GTTCTTTTTTGGACAACTAGTGCTTGTGGCTTCTTGGCTTACCTTTACTACTATCGCTTC 

ATCGAGAAGGGATTATCACCAATAAAAGATGAGAGTTACTTAACCAAGGAACACTTGGAC 

ACAAAAATAGACTGGATACCATCGATGAAGAACCTAAGACTAAAAGAOITCCCTAGCT^ 

ATCCGAACGACTAATCCTGACGACATCATGCTCAACTTTATCATCCGTGAGGCTGAC 

GCCAAACGCGCTTCAGCTATCATTCTCAACACGTTTGATGATCTCGAACACGACGTTATC 

CAATCTATGAAATCCATTGTACCTCCGGTTTATTCTATTGGACCGTTACATTTACTA6AG 

AAACAAGAGAGCGGCGAGTATAGTGAAATCGGACGGACAGGATCGAATCTTTGGAGAGAG 

GAGACTGA6TGTCTGGACT(3GCTAAACACGAAAGCTAGAAACAGTGTTGTGTACGTTAAC 

TTCGGGAGTATAACTGTTTTGAGCGO^AAACAGCTTGTGGAGTTTGCATGGGGTTTTO^ 

GCAACGGGGAAAGAGTTTTTGTGGGTGATCCGGCCGGATTTAGTAGCCGGGGATGAGGCA 

ATGGTTCCACCGGAGTTTTTAACGGCTACGGCGGACCGGAGGATGTTGGCAAGTTGGTGT 

CCTGftAGAGAAAGTCCTTTCTCATCCGGCCATTGGAGGGTTCTTGACGCATTGCGGGTGG 

AACTCGACGTTGGAAAGTCTATGCGGTGGAGTTCCAATGGTGTGTTGGCCGTTTTTTGCA 

GAGCAACAAACTAATTGTAAGTTTTCTCGTGACGAATGGGAGGTTGGGATTGAGATTGGT 

GGAGATGTGAAGAGAGAAGAGGTTGAGGCGGTGGTTAGGGAGTTGATGGATGAAGAGAAG 

GGAAAGAATATGAGAGAGAAGGCGGAAGAGTGGCGGCGCTTGGCGAATGAAGCGACGGAG 

CATAAGCATGGTTCTTCTAAATTGAACTTTGAGATGCTC6TTAATAAGGTTCTTTTAGGG 

GAGTAG • ■ . ■ 



>UGT85A3 ■ . \ 

ATGGGATCCCGTTTTGTTTCTAACGAACAAAT^CCACACGTAGTTTGCGTGCCTTACCCA 

GCTCAAGGCCACATTAACCCTATGATGAAAGTGGCTAAACTCCTCCACGTCAAAGG^ 

a^CGTCACCTTCGTCAACACCGTCTACAACCACAACCGTCTACTCCGATCCCGTGGGGCC 

AACGCACTCGATGGACTTCCTTCCTTCCAGTTCGAGTCAATACCTGACGGTCTTCCGGAG 

ACTGGCGTGGACGCCACGCAGGACATCCCTGCCCTTTCCGAGTCCACAACGAAAAACTGT 

CTCGTTCCGTTCAA6AAGCTTCTCCAGCGGATTGTCACGAGAGAGGATGTCCCTCCGGTG 

AGCTGTATTGTATCAGATGGTTCGATGAGCTTTACTCTTGACGTAGCGGAAGAGCTTGGT 

GTTCCGGAGATTCATTTTTGGACCACTAGTGCTTGTGGCTTCATGGCTTATCTACACTTT 

TATCTCTTCATCGAGAAGGGTTTATGTCCAGTAAAAGATGCGAGTTGCTTGACGAAGGAA 

TACTTGGACACAGTTATAGATTGGATACCGTCAATGAACAATGTAAAACTAAAAGACATT 

CCTAGTTTTATACGTACCACTAATCCTAACGACATAATGCTCAACTTCGTTGTCCGTGAG 

GCATGTCGAACCAAACGTGCCTCTGCTATCATTCTGAACACGTTTGATGACCTTGAACAT 

GACATAATCCAGTCTATGCAATCCATTTTACCACC'GGTTTATCCAATCGGACCGCTTCAT 

CTCTTAGTAAACAGGGAGATTGAAGAAGATAGTGAGATTGGAAGGATGGGATCAAATCTA 
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TGGAAAGAGGAGACTGAGTGCTTGGGATGGCTTAATACTAAGTCTCGAAATAGCGTTGTT 
TATGTTAACTTTGGGAGCATAACAATAATGACCACGGCACAGCTTTTGGAGTTTGCTTGG 
GGTTTGGCGGCAACGGGAAAGGAGTTTCTATGGGTGATGCGGCCGGATTCAGTAGCCGGA 
GAGGAGGCAGTGATTCCAAAAGAGTTTTTAGCGGAGACAGCTGATCGAAGAATGCTGACA 
AGTTGGTGTCCTCAGGAGAAAGTTCTTTCTCATCCGGCGGTCGGAGGGTTCTTGACCCAT 
TGCGGGTGGAATTCGACGTTAGAAAGTCTTTCATGCGGAGTTCCAATGGTATGTTGGCCA 
■p.p.j,r.jTQCXGAGa^CAAACAAATTGTAAGTTTTCTTGTGATGAATGGGAGGTTGGTATT 
GAGATCGGTGGA6ATGTCAAGAGGGGAGAG6TTGAGGCGGTGGTTAGAGAGCTCATGGAT 
GGAGAGAAAGGAAAGAAAATGAGAGAGAAGGCTGTAGAGTGGCGGCGCTTGGCCGAGAAA 
GCTACAAAGCTTCCGTGTGGTTCGTCGGTGATAAATTTTGAGACGATTGTCAACAAGGTT 
CTCTTGGGAAAGATCCCTAACACGTAA 



>UGT85A4 . : 

ATGGAACAACATGGCGGTTCTAGCTCACAGAAACCTCACGCAATGTGCATACCTTATCCA 

GCACAAGGCCACATCAACCCAATGCTGAAACTAGCCAAGCTCCTCCACGCTAGAGGCTTC 
CACGTCACTTTCGTCAACACCGACTACAACCACCGCCGTATCCTCCAATCACGTGGCCCT 
CACGCTCTCAACGGTCTCCCCTCGTTTCGCTTCGAGACTATCCCCGACGGTCTTCCTTGG 
ACAGACGTCGACGCTAAGCAAGACATGCTCAAGCTTATTGACTCCACAATAAACAACTGT 
TTAGCTCCATTCAAAGACCTCATCCTCCGGTTAAACTCCGGTTCTGATATACCACCGGTT 
AGCTGTATCATCTCCGACGCTTCAATGAGCTTCACy^TTGACGCAGCGGAGGAGCTTAAA 
ATTCCGGTAGTTCTCCTCTGGACCa^CAGTGCTACTGCTTTAATCTTGTATCTCCATTAC 
CAAAAACTCATCGAGAAAGAGATAATTCCCCTCAAAGATTCGAGTGACTTGAAGAAGCAT 
TTAGAGACGGAGATTGATTGGATACCGTCGATGAAGAAGATTAAGCTTAAGGATTTTCCA 
GATTTCGTCACCACGACGAATCCTCAAGATCCGATGATTAGTTTCATCCTTCATGTj^CC 
GGAAGAATCAAAAGAGCTTCTGCGATCTTCATCAACACTl'TCGAAAAACTCGAGCATAAC 
GTTCTCTTATCTCTGCGATCTCTTCTCCCTCAGATCTACTCCGTTGGACCGTTCCAGATT 
CTGGAGAATCGCGAAATCGATAAGAACAGCGAAATCAGAAAGCTAGGATTGAATCTCTGG 
GAAGAAGAGACGGAGTCTTTGGATTGGCTAGATACTAAAGCTGAGAAAGCTGTGATTTAC 
GTCAACTTCGGGAGTCTAACGGTTTTGACTAGTGAGCAGATCTTAGAGTTCGCTTGGGGT 
TTAGCGAGGAGCGGGAAAGAGTTTCTCTGGGTGGTGAGATCTGGTATGGTCGACGGAGAT 
GATTCGATTCTTCCGGCGGAGTTTTTATCGGAGACGAAGAATCGAGGAATGTTAATTAAA 
GGATGGTGTTCTCAGGAGAAGGTACTTTCGCATCCGGCGATTGGAGGATTTTTGACTCAC 
TGTGGATGGAATTCGACGTTGGAGAGTTTGTACGCCGGTGTTCCGATGATCTGTTGGCCA 
TTTTTTGCTGATCAGTTGACGAATCGAAAGTTCTGTTGCGAGGATTGGGGGATTGGGATG 
GAGATCGGCGAGGAGGTGAAGAGGGAGAGAGTGGAGACGGTGGTTAAAGAGCTCATGGAC 
GGAGAGAAGGGAAAGAGGTTAAGAGAGAAGGTGGTGGAGTGGCGGCGCTTGGCGGAAGAA 
GCTTCGGCGCO^CCGTTGGGATCaLTCGTACGTGAATTTTGAAACGGTGGTTAATAAAGTC 
CTTACATGTCACACGATTAGATCGACCTAA, . ■ . ■ \ ' 



>UGT85A5 ■ : : : ■ • : 

ATGGCGTCTCATGCTGTTACAAGCGGACAAAAACCACACGTAGTTTGCATACCTTTCCCG 
GCTCAAGGCCACATCAATCCGATGCTCAAAGTGGCTAAACTCCTCTATGCCAGAGGCTTC 
CATGTTACCTTCGTCAACACTAACTACAACCATAACCGTCTCATCCGGTCACGTGGTCCC 
AACTCCCTTGATGGGCTTCCTTCTTTTCGGTTCGAGTCCATCCCTGACGGTCTACCGGAG 
GAAAACAAGGACGTCATGCAGGATGTCCCTACCCTTTGTGAGTCCACCATGAAAAACTGT 
CTAGCTCCTTTCAAGGAGCTTCTCCGGCGGATCAACACCACAAAGGATGTTCCTCCGGTA 
AGCTGTATTGTATCCGACGGTGTGATGAGCTTTACTCTTGATGCTGCAGAGGAGCTTGGA 
GTCCCGGATGTTCTTTTTTGGACACCAAGTGCTTGTGGCTTCTTGGCTTATCTACACTTC 
TATCGCTTCATCGAGAAGGGGTTATCACCAATAAAAGATGAAAGTTCTTTGGACACAAAA 
ATAAATTGGATACCATCGATGAAAAACCTAGGACTTAAAGACATCCCAAGCTTTATCCGT 
GCAACTAATACTGAAGACATAATGCTTAACTTTTTTGTCCATGAGGCTGACCGAGCCAAA 
CGCGCTTCCGCTATCATTCTCAACACATTCGATAGTCTTGAGCATGATGTCGTCCGTTCT 
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ATTCAATCTATCATACCTCAAGTGTACACTATTGGACCGCTTCATCTATTTGTGAATCGG 
GATATCGACGAGGAAAGTGACATCGGACAGATAGGAACGAATATGTGGAGAGAGGAGATG 
GAGTGTTTGGATTGGCTTGATACTAAGTCTCCAAACAGTGTCGTTTATGTTAATTTCGGT 
AGCATAACAGTGATGAGTGCGAAACAACTCGTGGAGTTTGCTTGGGGTTTAGCAGCGACC 
AAAAAAGATTTTTTGTGGGTGATTAGGCCGGATTTAGTAGCCGGTGATGTGCCAATGCTT 
CCGCCGGACTTTCTAATAGAGACGGCTAACCGAAGGATGCTAGCGAGTTGGTGTCCTCAA 
GAAAAAGTTCTTTCTCATCCGGCAGTTGGAGGGTTCTTAACGCATAGTGGATGGAATTCG 
ACTTTGGAGAGTCTCTCCGGTGGAGTTCCAATGGTGTGTTGGCCGTTCTTTGCGGAACAG 
CaJ^CAAATTGTAAATATTGTTGTGATGAATGGGAAGTGGGGATGGAGATCGGTGGAGAT 
GTGAGGAGGGAGGAGGTTGAGGAGTTGGTTAGAGAACTCATGGACGGA6ACAAAGGAAAG 
AAAATGAGGCAAAAGGCCGAAGAGTGGCAGCGCTTGGCTGAGGAAGCGACGAAGCCTATT 
TATGGTTCGTCGGAACTAAATTTTCAGATGGTCGTTGACAAGGTTCTTTTAGGGGAGTAG 



>UGT85A7 . 

ATCGAATCTCATGTTGTTCATAACGCACAAAAGCCACACGTAGTTTGCGTGCCTTACCCG 

GCTCAAGGCCACATCAATCCGATGCTGAAAGTGGCTAAACTCCTCTACGCTAAAGGCTTT 
CACGTCACCTTCGTTAACACTCTCTACAACCACAACCGTCTCCTCCGGTCCCGTGGTCCC 
AACGCGCTCGACGGGTTTCCTTCATTCCGGTTCGAGTCCATCCCTGACGGTCTACCGGAG 
ACTGATGGCGATAGGACGCAGCATACTCCTACCGTTTGCATGTCCATTGAGAAAAACTGT 
CTCGCTCCATTCAAAGAGATTCTGCGCCGGATCAACGATAAAGATGATGTTCCTCCAGTG 
AGTTGTATTGTATCGGACGGTGTGATGAGTTTTACTCTTGACGCAGCCGAGGAACTAGGT 
GTCCCAGAGGTTATTTTTTGGACCAATAGTGCTTGTGGTTTCATGACTATTCTACACTTT 
TATCTTTTCATCGAGAAGGGTCTATCTCCTTTTAAAGACGAAAGTTACATGTCAAAGGAG 
CATCTAGACACAGTTATAGATTGGATACCATCAATGAAGAATCTTAGGTTAAAGGACATC 
CCTAGCTATATACGTACCACAAATCCTGACAACATAATGCTTAATTTCCTCATTCGAGAA 
GTTGAGCGATCTAAACGCGCTAGTGCTATCATTCTCAACACGTTTGATGAACTCGAGCAT 
GATGTTATCCAGTCTATGCAATCTATTTTACCTCCGGTTTATTCTATTGGGCCACTCCAT 
CTCCTTGTGAAGGAAGAAATAAACGAGGCTAGTGAAATAGGACAGATGGGATTAAATTTG 
TGGAGAGAGGAGATGGAATGTTTGGATTGGCTCGATACAAAAACTCCAAACAGTGTTCTT 
TTTGTTAACTTTGGATGCATAACGGTGATGAGTGCAAAACAGCTTGAAGAATTTGCTTGG 
GGTTTGGCGC3CAAGTAGGAAAGAGTTTTTATGGGTGATCCGTCCTAATTTAGTGGTGGGA 
GAGGCGATGGTGGTTCTTCCACAAGAGTTTTTAGCGGAGACGATAGACCGGAGAATGTTA 
GCTAGTTGGTGTCCTCAGGAGAAAGTTCTTTCTCATCCCGCGATAGGAGGGTTCTTGACG 
CATTGCGGGTGGAACTCAACATTGGAGAGTCTCGCTGGTGGTGTTCCGATGATATGTTGG 
CCATGTTTTTCGGAGCAACC(3ACGAATTGTAAGTTTTGTTGTGATGAGTGGGGAGTGGGT 
ATAGAGATTGGTAAAGATGTGAAGAGAGAGGAGGTCGAGACGGTGGTTAGAGAACTTATG 
GATGGAGAAAAGGGGAAAAAGCTGAGAGAAAAGGCGGAAGAGTGGCGGCGGTTGGCCGAG 
GAAGCGACGAGGTATAAACATGGTTCGTCGGTCATGAATCTTGAGACGCTTATACATAAA 
GTTTTCTTAGAAAATGTTAGATGA :__ 



>UGT86A1 ■ ; • • 

ATGGAGAGAGCAAAGTCGAGGAAGCCTCATATCATGATGATACCATACCCACTTCAAGGT 
CACGTTATCCCTTTTGTCCACTTAGCCATCAAACTTGCTTCTCATGGCTTCACCATCACT 
TTCGTCAACACCGACTCCATCCACCACCACATCTCCACCGCTCACCAAGATGACGCCGGT 
GACATCTTCTCCGCCGCTCGCAGCTCCGGCCAGCACGACATACGTTACACCACCGTGAGC 
GACGGCTTCCCTTTAGACTTTGACCGGTCACTGAACCATGACCAGTTTTTCGAAGGCATT 
CTCCACGTCTTCTCTGCCCACGTGGATGATCTCATCGCCAAACTCTCCCGCCGTGATGAT 
CCTCCCGTGACTTGCTTGATCGCCGACACGTTTTATGTTTGGTCATCTATGATTTGCGAC 
AAGCACAACCTTGTAAATGTCTCGTTTTGGACCGAACCTGCCTTGGTCCTCAATCTCTAT 
TATCACATGGATGTCCTCATATCTAACGGTCATTTCAAATCTCTTGATAATCGTAAAGAC 
GTGATCGATTACGTACCAGGGGTTAAAGCAATAGAACCAAAGGACTTGATGTCATATCTT 
CAAGTAAGCGACAAAGACGTAGACACAAATACAGTAGTATACAGAATATTATTCAAGGCC 
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TTTAAAGACGTCAAGAGAGCCGACTTCGTCGTATGCAACACGGTGCAAGAGCTCGAACCA 

GACTCTCTCTCGGCTCTACAAGCCAAACAACCGGTTTACGCTATCGGTCCGGTTTTCTCA 

ACTGATTCGGTAGTTCCCACAAGCTTATGGGCCGAGTCAGACTGTACCGAGTGGCTTAAG. 

GGCCGGCCCACTGGGTCAGTTCTCTACGTCTCGTTTGGTAGCTATGCACATGTTGGTAAG 

AAGGAGATTGTTGAGATAGCTCATGGGCTTTTGCTTAGTGGGATTAGTTTCATTTGGGTT 

TTACGTCCGGATATAGTTGGATCCAACGTACCAGATTTTCTTCCAGCCGGGTTTGTGGAC 

(:yUVGCCCAAGATCGAGGTCTTGTGGTCCAATGGTGCTGCCAGATGGAAGTTATTTCAAAT 

CCGGCCGTGGGAGGGTTTTTCACACATTGTGGATGGAATTCAATTCTAGAGAGCGTTTGG 

TGTGGTTTGCCTTTGTTGTGTTATCCACTTTTGACAGATCAGTTCACGAATAGGAAGCTT 

GTGGTCGATGATTGGTGCATTGGGATTAATCTTTGTGAGAAGAAGACAATCACAAGGGAC 

OUlGTCTCAGCGAATGTTAAAAGATTGATGAATGGAGAAACTTCyAGTGAGCTAAG^^ 

AATGTTGAAAAGGTTAAACGTCATCTCAZ^GATGCGGTTACAACCGTTGGATCTTCGGAG 

ACGAATTTTAACTTGTTTGTTAGTGAGGTCCGAAATAGAATAGAAAGTAAATTGTGTAAT 

GTAAATGGACTAGAAATAAGTCCATCAAACTAA . 



>UGT86A2 • • ; . 

ATGGCGGACGTTAGAAACCCTACAAAAAATCATCATGGTCATCATCATCTTCATGCTCTC 

TTGATCCCATATCCATTTCAAGGGCATGTAAACCCATTTGTACACTTAGCCATC^ 

GCGTCACAGGGGATCACCGTCACTTTCGTGAAa^CTCATTACATCCACCACCAG^^ 

AACGGCTCCGATGGAGATATTTTCGCTGGAGTTAGGTCAGAGTCTGGCCTTGACATAAGG 

TACGCGACGGTTTCCGATGGTTTACCGGTCGGATTTGACCGGTCGTTGAACCATGACACG 

TACCAATCGTCGCTGTTGCACGTGTTCTATiGCGCATGTGGAAGAGCTTGTC^ 

GTTGGAGGAGACGGCGGTGTGAATGTGATGATCGCCGACACATTCTTTGTTTGGCCGT 

GTGGTGGCTAGGAAGTTTGGTTTGGTTTGTGTCTCGTTTTGGACCGAAGCTGCTTTAOT 

TTTTCACTTTATTACCATATGGATCTGCTTCGGATTCATGGCCATTTTGGTGCTCAAGAA 

ACCCGCAGCGATCTAATCGACTACATTCCCGGAGTCGCCGCAATTi^CCCAAAAGACACG 

GCGTCGTATCTTCAAGAAACCGACACGTCATCAGTAGTTCATa^TCATCTTCAAA 

TTCGAAGACGTGAAAAAAGTCGATTTTGTACTCTGCAACACAATTCAGCM.TTCGAAGAC 

AAAACAATCAAAGCCCTAAACaLCAAAAATCCCATTTTACG 

TTGRATAACCAAACCGGTTCAGTCACAACCTaiCTCTGGTCTGAATCy^GATTGTAC^ 

TGGCTCAACACTAAACCAAAAAGCTCCGTACTTTATATCTCCTTTGGT^ 

GTCACAAAGAAGGATCTTGTTGAGATAGCTCAC6GGATTTTGTTGAGTAAAGTTAATTTC 

GTTTGGGTGGTGAGACCAGACATTGTTAGTTCAGACGAAACCAATCCATTACCAGAAGGG 

TTTGAAACAGAAGCTGGAGATCGTGGGATTGTAATACCATGGTGTTGTCAAATGACGGTT 

TTGTCACATGAGAGTGTTGGTGGGTTTTTGACACATTGTGGTTGGAACTCGATATTGGAG 

ACGATTTGGTGTGAGGTTCCTGTGTTGTGTTTTCCATTGTTGACTGATCAGGTTACGAAT 

AGGAAGCTTGTGGTTGATGATTGGGAGATTGGGATTAATCTTTGTGAAGATAAGAGTGAT 

TTTGGTAGAGATGAAGTTGGGAGGAATATTAACCGTTTGATGTGTGGTGTTTCGAAAGAG 

AAGATCG6ACG6GTTAJyUVTGAGTTTGGAAGGTGCGGTGAGAAACA.GT 

gagatgaatttAggtttgtttattgatggacttttgtctaa^ 
aaagcttaa ' ' . ■' ' . 



>UGT87A1 • .. •■ •■ ' 

ATGAATCCAATCAAACCTCAGCCACTCGGAGTCCGCCACGTGGTGGCCATGCCTTGGCCA 
GGAAGAGGCCACATCAACCCAATGTTAAACCTCTGCAAAAGCCTCGTCCGGCGAGACCCA 
AACCTCACCGTCACATTCGTCGTCACCGAAGAATGGCTCGGGTTCATCGGGTCCGACCCG 
AAACCTAACCGGATCCATTTCGCCACTCTCCCCAACATCATTCCCTCCGAGCTCGTCCGA 
GCCAACGACTTCATCGCCTTCATCGACGCCGTCCTCACCAGATTAGAAGAGCCGTTCGAA 
CAGCTACTTGACCGTCTAAACTCTCCTCCCACCGCAATCATCGCCGATACTTACATCATT 
TGGGCAGTACGTGTAGGCACAAAAAGGAATATTCCGGTGGCTTCTTTCTGGACTACGTCA 
GCCACGATTCTCTCCCTCTTCATTAACTCCGATCTTCTCGCAAGTCACGGCCATTTTCCG 
ATCGAACCATCAGAATCAAAACTAGACGAGATTGTTGATTACATCCCCGGTTTATCTCCG 
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ACAAGACTCAGTGACTTACAGATCTTACACGGCTATAGTCATCAAGTC^^ 

AAAAAGTCTTTCGGTGAGCTTTATAAAGCTAAGTATCTTCTCTTCCCTTCTGCTTATGAG 

CTCGAACCAAAAGCCATTGACTTTTTCACTTCCAAGTTTGATTTCCCGGTTTACTCCACT 

GGTCCGTTAATACCCTTGGAAGAACTATCCGTTGGAAATGAGAATAGAGAACTTGATTAC 

TTTAAGTGGCTTGATGAGCAACCTGAAA6CTCTGTTCTTTACATATCTCAAGGGAGTTTT 

CTTTCAGTCTCCGAAGCTCAGATGGAGGAGATTGTTGTAGGAGTTAGAGAGGCTGGAGTT 

AAGTTCTTTTGGGTGGCTCGTGGGGGTGAGTTAAAGCTTAAGGAGGCTCTTGAAGGTAGC 

TTGGGTGTTGTGGTGAGCTGGTGTGATCAGCTACGTGTTTTGTGTCATGCGGCTATAGGC 

GGGTTTTGGACGCATTGCGGGTATAACTCGACATTGGAAGGGATATGTTCGGGAGTACCG 

TTGCTTACATTTCCTGTTTTTTGGGATCAGTTTCTGAATGCTAAGATGATTGTTGAGGAG 

TGGAGAGTTGGT^TGGGGATCGAGAGGAAGAAGCAGATGGAGTTGTTGATAGTGAGTGAT 

GAGATCAAGGAATTGGTAAAAAGGTTTATGGATGGAGAGAGTGAAGAAGGGAAAGAGATG 

AGAAGAAGGACTTGTGATCTCAGTGAGATATGTCQTGGAGCGGTTGCGAAAGGTGGTTCT 

TCTGATGCTAACATCGATGCTTTCATTAAAGATATTACTAAGATCGTGTGA ■ 



>UGT87A2 ; • ; 

ATGGATCCAAATGAATCTCCACCy^CCAATTTCGCCACGTGGTGGCCATGCCTTATCCA 

GGTCGAGGACACATCAACCCTATGATGAACCTCTGCAAACGCCTTGTCCGTCGATACCCT 

AACCTTCACGTCACCTTCGTCGTCACAGAAGAATGGCTCGGGTTTATTGGACCCGACCCG 

AAACCCGACCGGATCCATTTCTCCACTCTCCCTAATCTCATCCCTTCCGAGCTTGTCAGG 

GCCAAAGACTTCATAGGCTTCATTGATGCCGTCTACACAAGATTGGAAGAACCAT^^ 

AAGCTTCTTGACAGCCTCAATTCACCACCTCCGAGTGTAATATTCGCCGACACTTACGTC 

ATTTG6GCTGTGCGAGTCGGCAGAAAAAGGAATATTCCGGTGGTTTCTCTCTGGACCATG 

TCAGCCACGATTCTCTCCTTCTTCCTCCACTCTGATCTACTCATAAGTCATGGCCATGCT 

CTGTTCGAACCATCAGAAGAAGAGGTTGTTGATTACGTCCCCGGTTTATCTCCGACGAAA 

CTCCGAGATTTGCCGCCGATATTTGACGGTTACAGCGACCGAGTCTTCAAGACAGCTAAG 

TTGTGTTTCGATGAACTACCAGGAGCTAGGTCTTTACTCTTCACCACCGCCTATGAGCTT 

GAACACAAAGCTATTGACGCTTTCACCTCCAAGCTCGATATCCCGGTCTACGCTATTGGT 

CCTTTAATACCTTTTGAAGAACTTTCTGTTCAAAATGATAACAAGGAACCTAATTACATC 

CA6TGGCTTGAGGAACAACCGGAAGGCTCTGTTCTTTACATATCTCAGGGAAGTTTTCTT 

TCGGTCTCGGAAGCTCAGATGGAGGAAATAGTGAAAGGACTGAGAGAAAGTGGAGTCCGG 

TTTCTTTGGGTGGCTCGTGGGGGCGAGTTAAAGCTTAAGGAGGCTCTTGAAGGTAGCTTA 

GGTGTAGTGGTGAGCTGGTGTGATCAGCTTCGGGTGCTGTGTCACAZ^GCTGTAGGCGGG 

TTTTGGACTCATTGCGGGTTTAACTCGACATTGGAAGGGATATATTCAGGAGTACCAATG 

CTAGCGTTTCCGTTGTTTTGGGATCAGATTCTGAACGCTAAGATGATTGTTGAGGACTGG 

AGAGTCGGAATGAGGATCGAGAGGACGAAAAAGAATGAGTTGTTGATAGGGAGAGAGGAG 

ATCAAGGAAGTAGTGAAGAGGTTTATGGATAGAGAGAGTGAAGAAGGGAAAGAGATGAGA 

AGAAGGGCTTGTGACCTTAGTGAAATCAGTCGAGGAGCTGTTGCGAAAAGCGGTTCGTCT 

AATGTAAACATCGATGAGTTCGTTCGGCATATTACCAATACAAATTAA 



>UGT88A1 • " ' • 

ATGGGTGAAGAAGCTATAGTTCTGTATCCTGCACCACCAATAGGTCACTTAGTGTCCATG 

GTTGAGTTAGGTAAAACCy^TCCTCTCCAAAAACCCATCTCTCTCCATCCACATTATCTTA 

GTTCCACCGCCTTATCAGCCGGAATCAACCGCCACTTACATCTCCTCCGTCTCCTCCTCC 

TTCCCTTCAATAACCTTCCACCATCTTCCCGCCGTCACACCGTACTCCTCCTCCTCCACC 

TCTCGCCACCACCACGAATCTCTCCTCCTAGAGATCCTCTGTTTTAGCAACCCAAGTGTC 

CACCGAACTCTTTTCTCACTCTCTCGGAATTTCAATGTCCGAGCAATGATCATCGATTTC 

TTCTGCACCGCCGTTTTAGACATCACCGCTGACTTCACGTTCCCGGTTTACTTCTTCTAC 

ACCTCTGGAGCCGCATGTCTCGCCTTTTCCTTCTATCTCCCGACCATCGACGAAACAACC 

CCCGGAAAAAACCTO^AAGACy^TTCCTACAGTTCATATCCCCGGCGTTCCTCCGATGA^ 

GGCTCCGATATGCCTAAGGCGGTGCTCGAACGAGACGATGAGGTCTACGATGTTTTTATA 

ATGTTCGGTAAACAGCTCTCGAAGTCGTCAGGGATTATTATCAATACGTTTGATGCTTTA 
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GAAAACAGAGCCATCAAGGCCATAACAGAGGAGCTCTGTTTTCGCAATATTTATCCAATT 
GGACCGCTCATTGTAAACGGAAGAATCGAAGATAGAAACGACAACAAGGCAGTTTCTTGT 
CTCAATTGGCTGGATTCGCAGCCGGAAAAGAGTGTTGTGTTTCTCT6TTTTGGAAGCTTA 
GGTTTGTTCTCAAAAGAACAGGTGATAGAGATTGCTGTTGGTTTAGAGAAAAGTGGGCAG 
AGATTCTTGTGGGTG6TCCGTAATCCACCCGAGTTAGAAAAGACAGAACTGGATTTGAAA 
TCACTCTTACCAGAAGGATTCTTAAGCCGAACCGAAGACAAAGGGATGGTCGTGAAATCA 
TGGGCTCCGCAAGTTCCGGTTCTGAATCATAAAGCAGTCGGGGGATTCGTCACTCATTGC 
GGTTGGAATTCAATTCTTGAAGCTGTTTGTGCTGGTGTGCCGATGGTGGCTTGGCCGTTG 
TACGCTGAGCAGAGGTTTAATAGAGTGATGATTGTGGATGAGATCAAGATTGCGATTTCG 
ATGAATGAATCAGAGACGGGTTTCGTGAGCTCTACAGAGGTGGAGAAACGAGTCCAAGAG 
ATAATTGGGGAGTGTCCGGTTAGGGAGCGAACCATGGCTATGAAGAACGCAGCCGAATTA 
GCCTTGACAGAAACTGGTTCGTCTCATACCGCATTAACTACTTTACTCCAGTCGTGGAGC 
CCAAAGTGA ' 



>UGT89A2 

ATGACGGAAGTGTTATTGTTGCCGGGAACTAAATCGGAGAATTCAAAACCACCGCACATA 

GTGGTGTTTCCATTCCCAGCACAAGGCCACTTACTTCCTCTACTTGACTTAACTCACCAA 

CTCTGCCTCCGTGGATTCAACGTCTCCGTCATCGTTACTCCCGGTAACCTTACTTACCTC 

TCTCCTCTTCTCTCCGCTCATCCCTCCTCCGTCACCTCCGTCGTTTTCCCTTTCCCTCCT 

CATCCTTCACTCTCTCCCGGCGTCGAAAACGTTAAAGACGTCGGAAATTCAGGAAATCTC 

CCGATCATGGCTTCTCTTCGTCAGCTACGAGAACCAATCATCAACTGGTTCC^ 

CCGAATCCGCCTATCGCTCTCATCTCCGATTTCTTCCTCGGATGGACTCACGATCTCTGC 

AATCAAATCGGTATCCCCAGATTCGCTTTCTTCTCCATCAGCTTCTTCTTAGTTTCCGTT 

CTTCAATTTTGCTTCGAGAACATCGATCTAATCAAATCAACGGATCC6ATTCATCTCCTT 

GATCTTCCTCGCGCTCCGATTTTCAAAGAAGAGCATCTTCCGTCTATAGTCCGACGAAGT 

CTCCAAACTCCGTCACCGGATCTCGAATCAATCAAAGATTTCTCCATGAATTTGTTG^^ 

TACGGATCTGTTTTCAA.TTCTTCTGAGATTCTGGAAGATGATTATCTT(aLGTACGTGAA^ 

CAGAGGATGGGTCATGATCGGGTTTATGTTATTGGCCCGCTTTGTTCAATCGGGTCGGGT 

CTTAAATCGAATTCGGGTTCTGTAGACCCGAGTTTGCTGAGTTGGTTAGACGGATCCCCA 

AACGGGTCAGTTCTATACGTTTGTTTCGGAAGTCAAAAGGCGTTGACTAAAGACCAGTGT 

GATGCTTTGGCTCTAGGCTTAGAGAAAAGCATGACCCGGTTTGTTTGGGTGGTTAAGAAA 

GATCCGATACCCGACGGGTTTGAGGATCGGGTTTCCGGAAGGGGATTGGTGGTAAGAGGA 

TGGGTCTCCCAGCTGGCGGTGTTGCGACACGTGGCGGTTGGTGGATTTTTGAGCCATTGT 

6GATG6AACTCAGTGCTTGAAGGGATAACGAGTGGGGCT6TGATCTTGGGCTGGCCCATG 

GAGGCGGACCAGTTTGTGAACGCGAGGTTGCTTGTGGAGCATTTGGGTGTTGCGGTTAGG 

GTTTGCGAAGGTGGTGAAACTGTGCCTGACTCGGATGAGTTGGGTCGGGTCATAGCGGAA 

ACGATGGGTGAGGGAGGACGCGAGGTGGCTGCTCGGGCTGAGGAGATACGGCGGAAGACC 

GAGGCTGCCGTGACGGAGGCyy^TGGAAGCTCCGTTGAAAATGTACAAAGACTTGTCa^ 

6AATTTGAAAAAGTCTAA " ■ . ' 



>UGT89B1 • 

ATGAAAGTGAACGAGGAAAACAACAAGCCGACAAAGACCCATGTCTTAATCTTCCCATTT 

CCGGCGCAAGGTCACATGATTCCCCTCCTCGACTTCACCCACCGCCTTGCTCTCCGCGGC 

GGCGCCGCCTTAAAiy^TAACCGTCCTAGTCACTCaU^AAAACCTTCCTTTT 

CTTCTCTCCGCCGTAGTTAACATCGAACCACTTATCCTCCCTTTTCCCTCCCACCCTTCA 

ATCCCCTCCGGCGTCGAAAACGTCCAAGACTTACCTCCTTCAGGCTTCCCTTTAATGATC 

CACGCGCTTGGTAATCTCCACGCGCCGCTTATCTCTTGGATTACTTCTCACCCTTCTCCT 

CCAGTAGCCATCGTATCTGATTTCTTCCTTGGTTGGACCAAAAACCTCGGAATCCCTCGT 

TTCGATTTCTCTCCCTCCGCTGCTATCACTTGCTGCATACTCAATACTCTCTGGATCGAA 

ATGCCCa^CCAAGATCAACGAAGATGACGATAACGAGATCCTCCACTTTCCCAAGATCCCG 

AATTGTCCAAAATACCGTTTTGATCAGATCTCCTCTCTTTACAGAAGTTACGTTCACGGA 

GATCCAGCTTGGGAGTTCATAAGAGACTCCTTTAGAGATAACGTGGCGAGTTGGGGACTC 
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GTCGTGAACTCGTTCACCGCCATGGAAGGTGTTTATCTCGAACATCTTAAGCGAGAGATG 

G6CCATGATCGTGTATGGGCTGTAGGCCCAATTATTCCGTTATCTGGGGATAACCGTGGT 

GGCCCGACTTCTGTTTCTGTTGATCACGTGATGTCGTGGCTTGACGCACGTGAGGATAAC 

CACGTGGTGTACGTGTGCTTTGGAAGTCAAGTAGTTTTGACTAAAGAGCAGACTCTTGCA 

CTCGCCTCTGGGCTTGAGAAAAGCGGCGTCCATTTCATATGGGCCGTAAAGGAGCCC6TT 

GAGAAAGACTCAACACGTGGCAACATCCTGGACGGTTTCGACGATCGCGTGGCTGGGAGA 

GGTCTGGTGATCAGAGGATGGGCTCCACAAGTAGCTGTGCTACGTCACCGAGCCGTTGGC 

GCGTTTTTAACGCACTGTGGTTGGAACTCTGTGGTGGAGGCGGTTGTCGCCGGC6TTTTG 

ATGCTGACGTGGCCGATGAGAGCTGACCAGTACACTGACGCGTCTCTGGTGGTTGATGAG 

TTGAAAGTAGGTGTGCGTGCTTGCGAAGGACCTGACACGGTGCCTGACCCGGACGAGTTA 

GCTCGAGTTTTCGCTGATTCCGTGACCGGAAATCT^CGGAGAGGATCAAAGCCGTGGAG 

CTGAGGAAAGCAGCGTTGGATGCGATTCAAGAACGTGGGAGCTGAGTGAATGATTTA^ 

GGATTTATCCAACATGTCGTTAGTTTAGGACTAAACAAATGA • ' • 



>UGT89C1 • ■ 

ATGACAACAACAACAACGAAGAAGCCGCACGTTCTGGTGATACCGTTTCCAC^ 

Cy^GATGGTTCCACaiTCTTGACCTCACGCATCAGATTCTTCTCCGTGGAGCCACCGTCACT 

GTCCTCGTCACACCCAAATVACTCTTCCTATCTCGATGCTCTCCGTTCTCTTCACTCCCCG 

GAACACTTCAAAACCCTAATCCTTCCTTTTCCTTCTCACCCTTGTATACCTTCCGGTGT 

GAATCTCTCCAGCAACTTCCTCTCGAAGCTATAGTTCACATGTTTGATGCTCTCTCTCGT 

CTCCACGACCCTCTCGTTGACTTTCTCAGCCGTCAACCACCGTCGGATCTCCCCGACGCC 

ATCCTAGGAAGCTCATTTCTCAGCCCTTGGATTAACAAAGTAGCTGATGCTTTCTCTATT 

AAGTCCATTAGTTTCTTACCCATCAATGCTCATTCGATCTCCGTCATGTGGGCTCAAGAA 

GATAGAAGCTTCTTCAACGATCTCGAGAGTGCCACAACGGAAAGCTACGiGGCTCGTCATC 

AACAGTTTCTACGACCTCGAGCCTGAGTTTGTAGAAACTGTTAAAACACGTTTCCTGAAT 

CACCACCGTATATGGACCGTCGGACCGTTGCTCCCCTTTAAAGCTGGCGTTGACCGTGGC 

GGACa^GCTCAATCCCGCCGGCGAAAGTCTCGGCTTGGTTAGATTCGTGCCCCGAGG^ 

AACTCCGTCGTATACGTCGGTTTTGGAAGCCAGATCCGGCTCACGGCGGAGCAAACAGCT 

GCTTTAGCGGCGGCGTTGGAGAAAAGCAGTGTGCGTTTCATATGGGCGGTGAGAGACGCA 

GCTAAGAAGGTGAACTCCAGCGATAACTCCGTTGAGGAAGATGTGATCCCGGCGGGATTT 

GAA6AGAGAGTGAAGGAGAAAGGACTCGTGATAAGAGGATGGGCCCCACAAACTATGATT 

CTTGAGCATCGAGCCGTTGGATCTTACCTAACTCATTTGGGTTGGGGTTCGGTTCTGGAA 

GGAATGGTCGGAGGAGTTATGTTGCTAGCGTGGCCGATGCAAGCAGACCATTTCTTTAAC 

ACGACGCTCATCGTTGATAAACTAAGAGCCGCAGTGCGAGTTGGAGAGAACAGAGACTCG 

GTTCCTGACTCGGACAAGCTCGCTAGGATTTTGGCTGAGTCGGCGAGAGAGGACTTGCCG 

GAGAGAGTTACGTTGATGAAGCTGAGGGAGAAAGCTATGGAGGCCATTAAAGAAGGTGGG 

AGCTCTTACAAGAACTTGGATGAGCTCGTTGCAGAGATGTGTTTGTAA 



>UGT90A1 • • ' . • ■ ' • ■• • ■ •' 

ATGTCCGTTTCAACACATCACCACCACGTGGTCCTCTTCCCTTTCATGTCAAAAGG^ 

ATCaiTCCCTCTCCTCCAATTCGGTCGTCTCCTCCTCCGTCACCACCGCAAAGAACCAACC 

ATCACCGTCACCGTTTTCACCACTCCCAAGAACCAACCTTTCATCTCAGACtTCCTCTCG 

GATACGCCGGAGATCAAAGTCATCTCTCTCCCTTTCCCGGAAAACATCACCGGAATCCCT 

CCCGGCGTCGAGAACACCGAAAAGCTCCCATCCATGTCACTTTTCGTCCCCTTCACACGC 

GCCACGAAGCTTCTCCAACCTTTCTTCGAAGAAACACTCAAGACTCTTCCAAAAGTTTCG 

TTCATGGTCTCTGATGGATTCCTCTGGTGGACATCGGAGTCTGCAGCTAAGTTCAACATT 

CCAAGATTTGTCTCCTACGGCATGAACTCTTACTCCGCCGCTGTCTCCATCTCTGTTTTC 

AAACACGAACTCTTTACCGAACCGGAAAGTAAATCTGATACCGAACCGGTCACTGTACCA 

GACTTTCCATGGATCAAGGTCAAGAAGTGTGATTTCGACCATGGCACTACCGAGCCGGAA 

GAATCAGGTGCAGCCCTCGAACTATCTATGGACCAAATCAAGTCGACCACCACAAGCCAT 

GGGTTTTTAGTCAATAGCTTCTACGAGCTCGAGTCAGCATTTGTTGATTACAACAACAAC 

TCTGGTGATAAACCAAAGTCGTGGTGTGTTGGGCCACTGTGTTTGACAGATCCTCCTAAA 
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CAGGGGAGTGCTAAACCGGCTTGGATTCATTGGTTGGATCAGAAGCGAGAGGAAGGGCGT 
CCGGTTTTGTACGTGGCGTTTGGAACGCAGGCAGAGATATCGAACAAGCAGCTTATGGAA 
CTAGCTTTCGGCTTGGAAGATTCAAAGGTGAACTTTCTGTGGGTCACAAGAAAAGATGTG 
GAGGAGATTATTGGAGAAGGATTCAACGATAGAATAAGAGAGAGTGGGATGATAGTGAGA 
GATTGGGTGGACCAATGGGAGATATTGTCACATGAAAGTGTCAAAGGATTTTTGAGCCAT 
TGTGGGTGGAACTCAGCACAAGAGAGCATATGTGTCGGGGTCCCATTGTTGGCTTGGCCG 
ATGATGGCCGAGCAACCGCTCAATGCGAAGATGGTTGTGGAGGAGATAAAGGTGGGAGTA 
A6AGTTGAAACGGAAGATGGGAGTGTAAAAGGTTTTGTGACAAGAGAAGAACTAA6TGGA 
AAGATTi^GAACTGATGGAAGGAGAAACGGGGAAAACCGCAAGAAAGAATGTAAAAGA^ 
TATTCGAAAATGGCGAAAGCGGCTTTGGTCGAAGGGACTGGTTCGTCATGGAAGAATTTA 
GATATGATTCTTAAGGAGTTATGTAAGAGTAGAGATTCAAACGGTGCTAGTGAGTAG 



>UGT90A2 • ■•■ • : 

ATGGAGTTAGAAAAAGTTCACGTGGTTTTGTTCCCATACTTGTCCAAAGGGCACATGATT 

CCTATGCTCCAATTAGCTCGTCTCCTCTTATCCCACTCCTTCGCCGGAGACATCTCCGTC 

ACCGTCTTCACCACTCCTTTGAACCGTCCTTTCATCGTTGACTCACTCTCCGGCACCAAA 

GCGACCATCGTCGACGTACCTTTCCCTGATAACGTCCCGGAGATCCCACCCGGCGTCGAG 

TGCACTGACAAACTCCCT6CTTTGTCGTCCTCCCTCTTCGTTCCTTTCACAAGAGCCACC 

AAGTCAATGCAGGCAGACTTTGAGCGAGAGCTCATGTCACTGCCACGTGTCAGTTTCATG 

GTCTCAGACGGTTTCTTGTGGTGGACGCAAGAGTCAGCTCGAAAGCTAGGGTTTCCTCGG 

CTTGTTTTCTTTGGTATGAATTGCGCTTCCACCGTTATATGTGACAGTGTTTTTCAAAAC 

CAGCTTCTATCTAATGTTAAGTCCGAGACGGAGCCAGTTTCTGTACCGGAGTTTCCGTGG 

ATTAAGGTTAGGAAATGTGATTTCGTTAAAGATATGTTTGATCCAAAAACCACCACAGAT 

CCTGOATTCT^GCTTATCCTAGATO^GTCACGTCTATGAATO^GC^ 

TTCAATACATTTGACGACCTTGAACCCGTGTTTATTGATTTCTACAAGCGTAAACGCAAA 

CTCAAGCTTTGGGCAGTTGGACCGCTTTGTTACGTAAATAACTTGGCTTGGATGATGAAG 

TAGAAGAGAAGGTCAAAdCTAGTTGGATGAAATGGCTAGATGAAAAG^ 

GG?^TGTTCTGTATGTGGCTTTCGGGTCACAAGCCGAGATCTCGAGAGAACAACTAGAGG 

AGATTGCGTTAGGGTTGGAAGAATCGAAGGTGAACTTCTTGTGGGTGGTCAAAGGAAATG 

AAATAGGAAAAGGGTTTGAAGAGAGAGTGGGAGAAAGAGGAATGATGGTGAGA6ATGAAT 

GGGTTGATCAGAGGAAGATATTAGAGCACGAGAGTGTTAGAGGGTTCTTGAGCCATTGTG 

GGTGGAATTCTCTGACGGAGAGCATTTGCTCGGAGGTTCCAATCTTGGCGTTTCCTTTAG 

CAGCGGAGCAACCTCTGAATGCGATTTTGGTGGTGGAAGAGCTGAGAGTGGCGGAGAGAG 

TGGTGGCGGCGAGTGAAGGGGTTGTGAGAAGAGAAGAGATTGCAGAGAAAGTGAAGGAGT 

TGATGGAGGGAGAGAAAGGGAAAGAGCTGAGGAGGAATGTCGAGGCATATGGTAAGATGG 

CGAAGAAGGCTTTGGAGGAAGGTATTGGTTCGTCTAGGAAGAATTTAGACAACCTTATCA 

ACGAGTTTTGTAACAATGGAACATGA " ' ' ' 



>UGT9QA4 • . • ■ ; \ ■■ • 

ATGGCCGTTTCATCGTCGCATCATGCGGTTCTCTTCCCTTACATGTCAAAAGGCCACACG 

ATTCCTCTCCTCCAATTCGCCCGTCTCCTCCTCCGTCACCGCCGTATCGTCTCCGTAGAC 

GACGAAGAACCAACCATTTCCGTO^CCGTCTTCACCACCCCAAAAAACGAAC^ 

TCAAACTTCCTCTCTGACGTCGCATCATCTATCAAAGTAATCTCCCTCCCTTTCCCTGAA 

AACATCGCCGGAATCCCTCCCGGCGTCGAGAGCACCGACATGCTCCCTTCCATATCACTT 

TACGTGCCCTTCACGCGCGCAACCAAATCTCTCCAGCCTTTCTTCGAAGCAGAACTCAAG 

AATCTTGAGAAAGTTTCTTTCATGGTCTCCGATGGATTCTTATGGTGGACATCGGAATCC 

GCCGCTAAATTTGAGATCCCGAGACTTGCCTTCTACGGCATGAACTCCTACGCATCGGCT 

ATGTGCTCCGCCATTTCGGTACACGAGCTCTTTACCAAACCGGAAAGTGTTAAATCTGAT 

ACTGAACCGGTTACTGTACCGGATTTTCCATGGATATGTGTTAAGAAGTGTGAGTTCGAT 

CCGGTTTTGACCGAACCGGATCAATCGGATCCAGCGTTCGAGCTACTCATTGACCATCTT 

ATGTCCACCAAGAAAAGCCGTGGAGTTATAGTGAACAGCTTTTACGAGCTCGAGTCAACG 

TTCGTTGACTACCGGCTCCGTGATAACGATGAACCAAAACCGTGGTGTGTTGGGCCTTTG 



77 



wo 2004/113540 PCT/GB2004/002432 



TGTTTGGTAAATCCTCCAAAACCGGAGAGTGATAAACCGGATTGGATTCATTGGTTGGAC 
CGGAAACTAGAGGAAAGATGTCCGGTTATGTATGTGGCGTTTGGAACGCAGGCTGAGATA 
TCGAACGAGCAGCTCAAGGAAATAGCATTAGGGTTGGAAGATTCCAAGGTCAATTTCTTG 
TGGGTCACGAGAAAGGACTTGGAAGAAGTAACTGGAGGATTAGGGTTC6AAAAGAGAGTG 
AAAGAGCATGGGATGATTGTGAGAGATTGGGTAGACCAATGGGAGATATTGTCACATAAA 
AGTGTCAAAGGGTTTTTGAGTCATTGTGGATGGAACTCGGCGCAAGAGAGTATTTGCGCT 
GGGGTTCCACTACTCGCTTGGCCAATGATGGCAGAGCAGCCACTCAATGCGAAGTTGGTA 
GTGGAGGAGCTAAAGATCGGAGTAAGAATCGAAACAGAAGATGTAAGTGTGAAAGGATTC 
GTGACAAGAGAAGAACTTAGTCGAAAGGTTAAACAATTGATGGAGGGAGAGATGGGGAAG 
ACAACGATGAAGAATGTAAAAGAGTATGCGAAAATGGCGAAAAAAGCTATGGCTCAAGGG 
ACK3GTTCGTCTTGGAAGAGTTTGGATTCGCTTCTGGAAGAGCTTTGTAAGAGTAGAGAG 
CCAGACGGTGTTAATAAGTTGTCAAGTTCTGATGCTTAG 



>UGT91A1 ■ 

ATGACAAACTTCAAAGACAACGATGGAGATGGAACCAAACTCCACGTGGTAATGTTTCCA 

TGGTTAGCCTTTGGTCACATGGTTCCATACTTGGAGCTCTCTAAACTCATAGCTCAAAAG 

GGTCACAAAGTCTCTTTCATTTCCACTCCACGTAACATCGACCGTCTCCTCCCATGGTTA 

CCGGAAAATCTCTCCTCCGTCATTAACTTCGTCAAGCTATCACTTCCCGTCGGCGACAAC 

AAACTCCCGGAAGACGGTGAAGCTACCACAGACGTCCCTTTCGAACTCATACCTTACTTA 

AAAATCGQfTTACGACGGGTTAAAAGTTCCGGTGACGGAGTTTCTTGAATCTTCGAAACCC 

GATTGGGTTCTTCAAGATTTCGCGGGGTTTTGGCTTCCTCCAATCTCTCGTCGTCTCGGA 

ATCAAAACCGGATTCTTTAGCGCTTTCAACGGCGCGACGCTCGGTATTCTTAAACCGCCG 

GGGTTCGAAGAGTACCGTACTTCGCCGGCGGATTTTATGAAGCCGCCTAAGTGGGTTCCG 

.TTTGAAACTTC(k3TAGCTTTCAAGTTATTTGAATGCyiGGTTCa.TTTTCA^ 

GCGGAAACCACCGAAGGGAATGTTCCCGACATCCACCGTGTCGGCGGCGTAATTGACGGC 

TGTGACGTCATCTTCGTACGGAGCTGTTACGAGTATGAAGCGGAGTGGTTAGGACTTACA 

CAAGAACTTCACCGGAAACCGGTTATACCGGTCGGAGTTTTGCCTCCAAAACCGGACGAA 

AAGTTTGAAGATACCGACACGTGGCTGTCTGTTAAAAAATGGTTGGACTCACGGAAAAGT 

AAGTCCATTGTCTACGTAGCTTTTGGTTCAGAAGCTAAACCGAGTCAAACGGAGCTAAAT 

GAGATC6CTCTCGGTTTAGAGCTTTCTGGTTTACCTTTCTTTTGGGTGTTAAAGACTCGT 

CGTGGTCCGTGGGATACCGAACCGGTCGAGCTTCCGGAAGGATTCGAAGAGCGTACAGCG 

GATAGAGGGATGGTGTGGAGAGGTTGGGTTGAGCAATTGCGTACATTGAGCCATGACTCG 

ATCGGTTTGGTTCTGACTCATCCCGGTTGGGGAACGATAATTGAAGCTATCCGGTTTGCT 

AAACCGATGGCAATGCTGGTTTTTGTGTATGACCAAGGATTGAATGCGAGAGTCATTGAA 

GAGAAGAAAATTGGGTATATGATCCCTCGAGACGAGACAGAAGGTTTCTTTACTAAAGAA 

AGTGTTGCGAATTCGCTAAGATTGGTAATGGTGGAAGAAGAAGGAAAGGTTTATAGAGAG 

AATGTGAAGGAGATGAAAGGAGTGTTTGGA6ATATGGATAGACAAGATCGTTATGTGGAT 

TCATTCTTGGAATATCTTGTTACTAATCGTTAA ■. ■ ■ ■ 



>UGT91B1. ■■■ . ■ • • ■ • 

ATGGCCGAGCCAAAACCGAAGCTTCATGTTGCAGTGTTCCCATGGTTAGCTTTAGGTCAC 

ATGATTCCTTACTTGCAACTCTCAAAGCTCATAGCAAGGAAAGGCCATACTGTGTCCTTC 

ATCTCCACAGCTCGTAACATTTCACGTCTTCCCAATATATCCTCCGACCTTTCCGTGAAT 

TTCGTTTCTTTGCCGTTAAGTCAAACCGTCGACCATCTCCCAGAGAACGCTGAGGCCACC 

ACTGATGTCCCGGAGACTCACATAGCTTATCTGAAGAAAGCATTTGATGGGCTTTCTGZ^ 

GCTTTCACAGAGTTTTTAGAAGCTTCCAAACCAAACTGGATAGTGTATGATATCTTGCAC 

CATTGGGTCCCGCCTATCGCTGAGAAGCTCGGCGTGAGACGAGCCATCTTCTGCACGTTC 

AACGCAGCTTCCATCATCATCATCGGTGGGCCAGCATCAGTCATGATTCAAGGTCATGAC 

CCTCGAAAGACTGCTGAAGATCTTATCGTGCCTCCACCATGGGTCCCGTTTGAGACCAAC 

ATAGTTTACCGTCTCTTTGAAGCTAAGAGGATCATGGAGTATCCCACGGCAGGTGTAACT 

GGAGTTGAATTGAACGACAACTGTAGATTGGGTTTGGCTTACGTTGGCTCTGAGGTTATT 

GTGATTAGATCATGTATGGAACTCGAACCTGAGTGGATTCAATTGCTCAGTAAACTCCAA 
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GGAAAGCCTGTGATTCCAATTGGTTTACTCCCGMSCTACACOyVTGGATGATGCaVGA 

GAGGGAACATGGTTAGACATCAGAGAATGGCTAGACAGACATCAAGCAAAGTCTGTGGTT 

TATGTAGCCTTAGGAACTGAAGTGACAATTAGTAACGAAGAGATTCAAGGTTTAGCTCAT 

GGGTTGGAGCTTTGCAGGTTACCTTTCTTTTGGACGCTAAGGAAGAGGACTAGAGCTTCT 

ATGCTACTACCTGAT6GGTTCAAAGAGAGAGTCAAAGAGCGTGGAGTCATTTGGACCGAG 

TGGGTACCTCAGACCAAGATACTGAGCCATGGTTCAGTTGGTGGGTTTGTTACTCATTGT 

GGTTGGGGATCAGCTGTGGAAGGGCTTAGCTTTGGTGTCCCTTTGATCATGTTTCCATGT 

AACCTAGACCAGCCGCTAGTGGCTAGGTTGCTCAGTGGQATGAATATAGGCTTGGA 

CCAAGGAATGAGCGAGACGGGCTGTTCACGAGTGCTTCTGTTGCAGAGAG?^TCAGACAT 

GTTGTTGTGGAAGAAGAAGGAAAGATCTAGAGGAACAATGCTGCATCTCAGCAJ^ 

ATATTCGGGAACAAGAGATTGCAAGATCAGTATGCGGATGGTTTTATCGAGTTTCTGGAG 

AATCCTATAGCAG6AGT6TAG ' ' 



>UGT91C1 , [ , 

ATGGTCGACAAGAGAGAAGAAGTTATGCACGTAGCCATGTTTCCATGGCTAGCTATGGGT 

CATCTCCTTCCTTTTCTTCGTCTCTCCAAGTTACTAGCTCAAAAGGGTCACAAGATCTCT 

TTCATATCAACACCAAGAAACATCGAAAGACTTCCTAAATTACAATC^^ 

TCCATCACCTTCGTCTCTTTCCCTCTCCCTCCCATCTCAGGCTTGCCTCCTTCTTCAGAA 

TCATCCATGGACGTTCCTTACAACAAGCAACAGTCTCTTAAAGCCGCTTTTGATCTTCTT 

CAGCCACCGTTGAAAGAGTTTCTCCGACGGTCTTCTCCGGATTGGATCATATACGACTAT 

GCTTCTCACTGGCTTCCTTCTATTGCGGCCGAGCTTGGAATCTCTAAGGCTTTCTTTAGT 

CTCTTTAACGCAGCTACTCTCTGTTTCATGGGACCGTCTTCGTCTTTGATTGAAGAAATT 

AGATCAACGCCGGAAGATTTCACGGTGGTGCCACCGTGGGTCCCGTTCAAGTCAAACATC 

GTGTTTCGTTATCATGAAGTTACTAGATACGTTGAGAAGACAGAGGAAGATGTAACCGGA 

GTCTCTGACTCAGTTCGGTTTGGTTACTCGATTGACGAAAGCGATGCGGTTTTTGTCCGT 

AGCTGTCCGGAGTTTGAACCGGAATGGTTTGGTTTACTAAAAGACCTGTACCGTAAACCG 

GTATTTCCAATCGGGTTTTTGCCTCCGGTTATTGAAGACGAC6ATGCCGTTGATACTACA 

TGGGTTCGTATA?^GAAGTGGCTCGACAAGCAACGGCTTAATTCAGTTGTTTACGTGT(^ 

CTTGGCACCGAAGCGAGTCTTCGTCATGAGGAAGTAACTGAGCTAGCTCTTGGGTTAGAG 

AAGTCAGAGACACCGTTCTTTTGGGTCCTAAGGAACGAGCCAAAGATTCCAGATGGGTTC 

AAAACACGAGTCAAGGGACGTGGAATGGTTCATGTTGGTTGGGTTCCACAAGTGAAAATA 

CTTAGTCACGAGTCAGTAGGAGGGTTCTTGACACATTGTGGTTGGAACTCAGTGGTGGAA 

GGGTTAGGGTTTGGTAAAGTTCCAATCTTTTTTCCGGTGTTGAATGAGCAAGGACTTAAT 

ACGAG6TTGTTGCATGGGAAAGGACTTGGT6TTGAGGTTTCAAGAGATGAGAGAGATGGG 

TCGTTTGATTCTGACTCGGTCGCTGACTCGATTAGGTTGGTGATGATTGATGATGCTGGC 

GAGGAGATAAGGGCTAAGGCTAAAGTGATGAAGGATTTGTTTGGGAACATGGATGAGAAT 

ATTCGTTATGTTGACGAACTTGTTAGGTTTATGAGAAGTAAAGGATCATCATCATCATCA 

TGA • 



>UGT92A1- - ' 

ATGGCGGAAGCTAAACCCAGAAATCTGAGAATCGTGATGTTCCCTTTCATGGGACAAGGC 

CATATCATCCCGTTTGTAGCTTTAGCCCTTCGTTTAGAGAAGATTATGATTATGAACAGA 

GCGAACAAAACCACCATCTCTATGATCAATACTCCTTCGAACATCCCCAAfl^ 

AATCTTCCACCTGAATCCTCCATAAGTCTCATAGAGTTACCTTTCAACAGCTCTGATCAT 

GGCCTTCCTCACGACGGCGAGAATTTCGATTCTCTTCCTTACTCTCTCGTCATCAGCCTT 

CTTGAAGCTTCTAGGTCGCTTCGTGAGCCCTTTCGAGACTTCATGACGAAGATCTTGAAG 

GAAGAAGGGCAGAGCTCGGTTATAGT6ATCGGTGATTTCTTCTTGGGTTGGATCGGTAAG 

GTTTGCAAAGAGGTTGGTGTTTATTCAGTGATCTTTAGTGCTTCTGGTGCTTTTGGTTTA 

GGTTGTTATAGATCCATATGGTTAAACTTGCCACATAAAGAAACCAAACAAGATCAGTTT 

CTCTTAGATGATTTCCCTGAAGCAGGGGAGATTGAGAAAACTCAGTTGAATTCTTTCATG 

TTAGAAGCTGATGGAACCGATGATTGGTCTGTTTTCATGAAGAAGATTATACCTGGATGG 

TCTGACTTCGATGGATTCTTGTTCAACACGGTTGCTGAAATCGATCAGATGGGATTATCC 
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TACTTCCGTAGAA.TAACCGGTGTTCCGGTTTGGCCAGTTGGGCCGGTTTTGAAGTCTCCG 
GATAAGAAGGTGG6ATCGAGGTCGACAGAGGAAGCAGTGAAGTCATGGCTTGACTCAAAA 
CCGGACCATTCGGTTGTGTACGTATGTTTCGGTTCAATGAACTCGATTTTGCAAACGCAT 
ATGTTAGAATTGGCTATGGCATTAGAGAGTAGCGAGAAGAACTTCATATGGGTGGTGAGG 
CCGCCCATAGGTGTGGAGGTGAAGAGTGAGTTTGATGTGAAAGGGTATCTACCGGAAGGA 
TTTGAGGAAAGAATAACAAGATCGGAAAGAGGGTTACTTGTGAAGAAATGGGCACCACAA 
GTTGATATATTGTCACACAAGGCAACATGTGTGTTTTTGAGTCATTGCGGATGGAACTCG 
ATACTCGAATCACTTAGCCACGGTGTGCCACTGCTCGGATGGCCCATGGCAGCCGAGCAG 
TTCTTCAATTCCATATTGATGGAGAAACATATTGGGGTATCGGTTGAGGTGGCGCGTGGG 
AAGAGATGTGAGATCAAATGTGATGACATTGTTTCTAAGATCAAACTGGTGATGGAGGAG 
ACTGAAGTAGGGAAAGAGATTAGGAAGAAGGCTAGAGAGGTGAAGGAGTTAGTGAGGAGA 
GCmTGGTAGATGGAGTTAAAGGTTCCTCCGTCaVTTGGTTTGGAAGAGTTTCTTGACCAA 
GCAATGGTCAAGAAAGTGGAGAATTGA ; 
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Claims 

1. A nucleic acid molecule that encodes a chimeric polypeptide wherein said 
nucleic acid molecule comprises a first part selected firom the group consisting of: 

a nucleic acid molecule consistmg of a nucleic acid sequence which 
encodes an amino acid sequence as repres^ted in Figure 3; 
a nucleic acid molecule which hybridises to the nucleic acid molecule 
in (i) and which specifically targets said chimeric polypeptide to an oil 
body; and 

a nucleic acid molecule which differs fi:om the nucleic acid molecules 
of (i) and (ii) due to the degeneracy in the genetic code; and a second 
part that encodes an heterologous polypeptide. 

ieic acid molecule according to Claim 1 \yherein said second part 
encodes an enzyme. 

15 3.' A nucleic acid according to Claim 1 wherein said second part encodes a 
pharmaceiitical polypeptide. 

4. A nucleic acid molecule according to Claim 1 wherein said second part 
encodes an antigenic polypeptide. 

20 • 

5. • A nucleic acid molecule according to Claim 1 wherein said second part 
encodes a mussel adhesive polypeptide encoded by a nucleic acid molecule selected 
from the group consisting of: 

25 i) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid motif 

X-K-X-X-Y-P-X-X-Y-K. 
wherein X is any amino acid residue; 



10 



i) 
ii) 

iii) 



2. A nucl 
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ii) a nucleic acid molecule which encodes a polypeptide wherein at least one of 
the residues in said motif is modified by hydroxylation; and 

iii) a nucleic acid molecule which encodes a polypeptide which has adhesive 
properties. 

5 

6. A nucleic acid molecule according to Claim 1 wherein said second part 
encodes a spidCT silk polypeptide comprising a nucleic acid sequence selected from 
the group consisting of: 

i) a nucleic acid molecule consisting of the nucleic acid sequence as 
10 represented in Figure 4a or 4b; 

ii) . a nucleic acid molecule which encodes a polypeptide domain 

comprising the amino acid sequence AGRGQGGYGQGAGG and at 
least two motifs rich in polyalanine wherein said polyalanine motifs., 
comprise at least 6 alanine amino acid residues; 

15 iii) a nucleic acid molecule which hybridises to the sequence presented in 

(i) and (ii) above and which encodes a silk polypeptide; and 

iv) a nucleic acid sequences that are degenerate as a result of the genetic 
code to the nucleic acid sequence defined in (i) - (iii) above, 

20 7. A nucleic acid molecule according to Claim 1 wherein said second part 
encodes a polypeptide with glucosyltransferase activity and is encoded by a nucleic 
acid molecule selected frona the group consisting of: 

i) a nucleic acid molecule selected from the group consisting of nucleic acid 

sequences as represented in Table 1; 
25 ii) a nucleic acid molecule which hybridise to the sequences represented in (ii) * 

above; and which have glucosyltransferase activity; and 
iii) nucleic acid molecules consisting of nucleic acid sequences which are 

degenerate as a result of the genetic code to the sequences defmed in (i) and 

(ii) above. 

30 
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8. A nucleic acid molecule according to any of Clmas 1-7 wherein said 
molecule encodes a cleavage site to facilitate the purification of said second part 
from said first part. 

5 

9. A nucleic acid molecule according to Claim 8 wherein said cleavage site is a 
proteolytic cleavage site. 

10. A chimeric polypeptide encoded by a nucleic acid molecule according to any 
10 of Claims 1-9. 

11. A vector comprising a nucleic acid molecule that encodes a. chimeric 
polypeptide according to Claim 10. . 

12. A cell transfected or transformed witb a nucleic acid molecule according to any 
of Claims 1 -9 or a vector according to Claim 11. 

15 13. A cell according to Claim 12 wherein said cell is a plant cell. 

14. A method to produce a chimeric polypeptide according to Claim 10 
comprising the steps of: 

i) providing a cell according to Claim 12 or 13 and growth conditions 
. : conducive to the production of a chimeric polypeptide according to 

20 " the invention; and optionally 

ii) . purifying said chimeric polypeptide firom said cell or growth media. 

15. A plant comprising a cell according to Claim 13. 

16. A seed comprising a cell according to Claim 13. 
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A vessel comprising a cell according to Claim 12 or 13. 

A vessel comprising a polypeptide according to Claim 10. 

A vessel according to Claim 16 wherein said vessel is a bioreactor. 

A vessel according to Claim 17 wherein said bioreactor is a fermentor. 

A vaccine preparation comprising a cell, or a fiaction of a cell, according to 
Claim li2 or 13 or a plant according to Claim 15. 

20. A preparation according to Claim 19 wherein said preparation is an oil body 
containing fraction. 

21. A preparation according to Claim 19 or 20 wherein said preparation includes 
at least one carrier. 

22. A preparation according to Claim 20 or 21 wherein said preparation includes 
15 at least one adjuvant. 

23. A method to immunise an animal to at least the second part of the chimeric 
polypeptide according to Claim 10, comprising administering an effective amount of 
a chimeric polypeptide or preparation according to any of Claims 19-22 sufBcient to 

20 stiinulate an immune respoiise to said second part. 

24. A methpd according to Claim 23 wherein said animal is human. 

25. An antibody obtainable by the method accordiag to the invention. 
25 . 

26. An antibody accordiag to Claim 25 wherein said antibody is a monoclonal 
antibody or binding fragment thereof 



17. 
16. 
17. 
18. 
5 19. 
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27. A method for prepaiing a hybridoma cell-line producing monoclonal 
antibodies according to Claim 26 comprising the steps of: 

i) immunising an immuaocompetent mammal with a chimeric 
polypeptide or preparation according to any previous Claim; 
5 ii) fusing lymphocytes of the immunised immunocompetent mammal 

with myeloma cells to form hybridoma cells; 

iii) screening monoclonal antibodies produced by the hybridoma cells of 
step (ii) for binding activity to polypq)tide of the invention; 

iv) culturing the hybridoma cells to proliferate and/or to secrete said 

10 monoclonal antibody; and 

v) recovering the monoclonal antibody from the culture supernatant 

28. A hybridoma cell-Une obtainable by the method according to Claim 27. 

15 29. A method to prepare a vaccine preparation according to any of Claims 19-22 
comprisiag; 

i) . forming a cell extract from a plant cell culture or plant according to 

the invention; and 
iii) separating said extract into an.oil body rich fraction. 



20 



30. A method according to Claim 29 wherein said extract is incubated with a 
protease that cleaves said second part from said first part. 



25 
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Figure la 1/4 
CTAAAGCCATGGATGATGCTCGTAAAATCACTTCCACAAGCC^^ 

ctit:ttggacctgttcaagcacattgtgctgagt<3Attt^ 

GATCAGAGAGTGCCTGCTACGGCAGCATATTATAGCAGGTGGCCTGTTTCAGTT^ 

TTCAACTTTTCCAGATGCCAGCCGCGATGCTTGGTCATCTTACTGATTTCTTO^ 

GAATCATGGCTTCCTTGGCATACTCAGAAACATCTTCTTAATAAGACTGAAGATACCAAA^ 

GCCGACTTTATAAGCACGATAGGGTATTTAGATTOICGAATOGACCTTCACGGGACG 

ACCAGGCAGACGAAGTGATTTCAAATGCAGATAATCCAAGCCTGAAAGAAGGGCAC^ 

AGGAGCCCTTGGGAACCGATCTCTCATGGATCTTTGTATCATGGCGTCAAAGCTTGCTTATOAA^ 

AAAGTTGOTCAAAGAGTAGTTGCCGAACATTGGAAGATGCATTTCGTGGCTGACTATGM^ 

ATTTCC^GATGCAAGGAAC^^CTCATGCGTTCATCTTTTGTGAC^ 

GATCAGCITCAGAGGCACAGQACCTTTTAGTATACCAAATTGGTGT^ 

GGGTTGGGAGACGCAGGAAGTGTCCATGTTGGATTCTTAGAAGCAATGGGTTTGGGTC^ 

TTTCCAGCTTTGAGACTAGCATTi^CACAAAGTCGCCAGGAAGCATAACC 

GATGGCTCCGGACCACTTGGTATGGGCATATGATGGTGTTTACTTTCTTGCGGC^ 

TTACTiU^GACCACAAGAACGCAAAATTTGTAGTCACTGGGCATAGCT^^^ 

TGTTCACATGCATTCTTGAGATACyiGa^GGAGACyVGAGGTGCTTGACA^ 

CGGACAGOTAGGATTGGGAACTATAATCTTGGTTACTTCATGCAGAACCGTCTC^TTTTC^^ 
AGGTATTTCAGGGTGGTTTACTGCAATGACATGGTTCCTAGGGTGCCTTTCGATGATGTCTTCTTC^^ 

TCGAGCATITCGGAACCTO'cATTTACTATGATAGCCGCTTCTTTGGCTACm 

CAGAAACCCTTTCGGAATAGAAAATGCCATCAGTGCGCACATCACCGCCTGGTGGGAGCTCTGGA^ 

TTCATATTAAATCACGTATATGGCGCAGAATACAAGGAGACCTGGGAATCCAGAATGTTCAGGATAOT^ 

GACTGTTTCTCCCTGGTGTTGCAGCTCATAGTCCTGTGAATTATGTCAATTCTGTCAGGCTTGG^^ 

GCTTGCAATTCCCTTGATGTCTCTGAAAATGATGGdlCAAGGTTACT^^ 

GAGATGATCyVTTAATGATAAATGGTTCACTCTTTGCCA^^ 

TTGTAATAAAAGGATAGCTGTTTCATGAACAGGTCGCCTAGGGTTGTGGTGTGGAG 

TATGCATATATGGCCTGTTTGTTTGTCAGTTTGTTTTTCTCm 
TAT^AAAGGAACGTTGAGTAGTTTTTOGGTTGCCAA^ 

Figure lb>RcOBI»l 

MDDAGklTSTSHLIVSPDEGTFI^DLFKHIVLSDLGSGAKFFRASDQRVPATAA^^ 
pQMPAAML^g^ 

DBVISNADNPSLKEGHNSKIKGALGNRSLMDLCIMASKIAYENTKVVERWA^ ' 

D^JiNTHAFIPCDKPKDAMLIviSFRGTGPFSIPI^ 

FETSIOTKSPGSITELRKESBMAPDHLWATOGVYFL^ 

CILEIQQETEVLDRLIiNVYTFGQPRIGNYNLGYFMQNRLNFPERRYFRVVYCNDMVPRW 
FGTCIYraSRFFGYFTKEEPSRNPFGIENAISiUilTAWWELWRSFILNHVYGAEYK^ 
LPGVAAHS PVNYVNSVRLGRBLAIPLMSLKMMAQGY 
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Figure 2 2/4 

GATCTTCAAAGTAGTTTTGAACTCTCTGCTGAGGGAAAAAAAGAGOXS 

ATGGCTGCTTCTGCTACTACTAGCAATAATATTGCTCCAAACTTCTTGGTTGT^ 

GAAAAAGAGACATATTCAAGTATTTGGTGAGGi^GATGTQAAGAGTGGAATGAG^ 

AGAGGAAGGAGTTAAAGGTGGCGCAGCAGTTGATCATAGGTGGATTTTATTGGTTTCTA^ 

AGGGTTCTTGCGCTTATTGATACCCCATTAAAGTACCTTGGATATGTCATTGATTTCTTTCTC^ 

TCTCCCAAAATAGTGGATTCTCTOGCATACTCAAa^CT^ 

AGGAACyVGAGAATTTTATAAGCACGATTGGGCAATTGGATCGGCG 

TTATCGGAGAAAGTAGATGATTCTGTTGCTACTGATGTTAACAACATTAAAGCAGAACTGGGTAATCQAT 

ATCTCATGGATCTTTGTATCATGGCAGCCAAACTTGTCTATGAGAATGAGAAAGTTGCTCAAAATGTTGT 

TGATCGTCyVCTGGAAGATGCATTTTGTGGCTTTCTACAACTGCTGGAATGAGTACa\^^ 

ACCCAAGTGTTCATATGTTGTGACAAGCCAAAGGATGCy^TTTGATAGTGGTCAGC^^ 

AACCATTTAATGCU^CAAGATTGGAGTACGGATTTTGATTTCTCGTGGTATGAAATCCCAAAM 

GATCCATATTGGATTCTTAGAAGCTTTAGGTCTGGGCAACAGAAGTGACGCTACCACTTTCCAAACTCAC 

CTTCAGAGGAAACATACAGGTTTCTTCCATCTAAATGGTGAGTCTGAAGGCAATATGACGGAATGGGCAA 

AGAaGAGTGCATACTATGCTGTCGCGTTGAAGCTAAAGAGCTTACTGAAAGAACACAGGi^ 

TATAGTCACTGGACATAGTTTAQGTGOAGCACTTGCAATATTGTTCCCGTCAATACTGGTO 

GAGACAGAGATGCTAAACAGGTTGCTGAACATATACACATTTGGGCAGCCAAGAATTGGAGAT6CACAGC 

TTGGAACTTTCATGGAGTCCCACTTGAATTATCCAGTTACTAGATACTTCAGGGTTGTTTACTGCA^ 

TATGGTGCCTAGAGTGCCTTTCGATGACAAGATTTTCGCTTTCAAGGATTTOSGTACATGTCra 

GATAGCCGCTACTTTGGCCGATTTATGGATGAGGAGCCGAACAGAAATTATTTTGGACTGAGACACATAA 

TTCCAATGTCGGTGAATGC^TTATGGGAACTATTCAGAAGTTTTATGATAACCCATC 

CTACCAGGAGAGTTGGTTCTGCACTCTTTCCAGGGTAGCAGGACTGGTGCTTCCTGGTGTTGCTGCT 

AGTCCTATAGATTATGTTAATTCAGTTAGGCa?TGGAAAGGAGAGAGTAGCTC(:7^TGACATCCTTGAAA^ 

GCTTCGCTCGCAAGTCATAAATCTGGGTTGCACTTGTACTCTTCTTCATGGATO^ 

GQAAAATAATAACAGGGTGCyVGTTTAAAATGATCATAAGGGAAAT 

GAAATTTGATAATCTGTGACCTTGTGGTTGTGGGTAGTTCC^ 

ATCCTGTACTTCGGTGATAATATGAATTATAGTGTGACTTTTTTGGTTGCCCATAGAi^^ 
AAA?UUiAAAA 

Figure 2b>RcOBL2 

MAASATTSNNIAPNFLVVDPKKGRKRDIFKYLVRK^ 

RVLALIDTPLKYLGWIDFFLNLISQNSGFSGILNNFLHGNLKIPRRGTENFISTIGQLDGRIDLYRTTI 
LSEKVDDSVATDVKNIKAELGNRYLMDLCIMA^ 

TQVFICCDKPKDi^LIwiSPRGTEPFNAQDWSTDPDFSPraSIPKVGKIHIGFLEALGL^ 

LQRKHTGFFHLNGESEGimTEWAKKSAYYAVALKLKSLLKEHRNAKPIVTGHSLGGA^ 
ETEMLNRLLNIYTFGQPRIGDAQLGTFMESHLNYPVTRYFRVVYCl^ 
DSRYFGRFMDEEPNRNYFGLRHliPMRVNALWELFRSFMITHAHGPp 
SPIDYVNSVRLGKERVAPMTSLKSFARiCS 



Figure 3 >RcOBLl core hydrophobic region (50-105) 
ATAAYYSRWPVSVFICKILQLFQMPAAMLGHLTDFLLNPyYQiraGFLGIIJ^ 
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Figure 4 a 3/4 
5» - 

GCCGCAGCAGCAGCTGGACAGGGCGGTCAAGGTGGATATGGTGGACTAGGTCA^ 

TTCTACAGCCGCCGCOSCCGCAGCCGCCGCCGCCGCCGCAGGTGGACaAGGTGGACAAGGTCAAGGAGGAT^^ 
GTGCAGGAAGTTCn^CAGCCGCCGCCXSCTGCaiGC^^ 
CAAGGTTCTGGAGGTAATCCCGCTGCAGCAGCCGCTGCAGCTGC^^ 
ACAAGGTCAAGQTGGATATGGACAAGGTGCAGGAATTTCTO 

CAGCTGGACGAGGTCAAGGAGGATATGGTCAAGGTGCTGGAGGTAATGCCGCTGCAGCAGCCGCAGCAGCTGCCGCCGCC 

GCCGCAGCIX^GACAGGGCGGTa^GGTGGATATGGTGGACA^ 

AGCCGCTGCCGCAGCAGCAGCTGGACGAGGTCAA - '3' 
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Figure 4b 4/4 
51 - 

GCGGCCGCGGCGGCAGCGGCGGCAGCGGCGGCAGGGGGACAAGGTGGACAAGGTCAAGGAGGATATGGACMGGTC 

AAGTTCTGCAGCCGCCGCCGCTGCAGCAGCAGCAGCaGCAGCCGCAGCAGaTOGACGAGGTC^ 

GTTCTCGAGGTAATGCCGCTGCAGGAGCCGCTGa^GTTGCAG^^ 

GGTCAAGGTGGATATGQACAAGGTQCAGGAAGrrCTGCAGCTO 

TGGACGAGGTCAAGGAGGATATGGTOVAGGTGCTCGAGGTAATGCCGCTGC^^ 

CAGCTGGACy^GGGCGGTCAAGGTGGATATGGTGGACyVAGGTCTAGGAGGATATGGACAAGGTGCyVGG^^ 
GCTGCCGCOKIAGCAGCaiGa^GCCGCCGCAGCA^ 
CGCTGC'AGCAGCCGCCGCAGCAGGaX3GAC?UVGGTC^ 
CCGCTGCCGCAGCAGCAGCTGGA ' - 3 *' ' 



